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(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6550 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(ix) FEATURE: 

(A) NAME /KEY: exon 

(B) LOCATION: 4201.. 4425 

(D) OTHER INFORMATION: /product= "Peroxidase 

(ix) FEATURE: 

(A) NAME /KEY : 

(B) LOCATION: 

(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 

(ix) FEATURE: 

(A) NAME / KEY : 

(B) LOCATION: 

(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 

(ix) FEATURE: 

(A) NAME /KEY : 

(B) LOCATION: 

(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 

(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 

. .6068) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

CCATGGCCAG TTGCCGGTGG AGCAGGTAAA AACACCGTAG CGTAGCAGCC AGGCGGAAGC 60 

AGACGCACAG CACAGGTTGG TTATGATAGT CAGCCGGGCC ACATGTGTGT AGTTGGTACA 120 

CTGATACGCT TACACTGTCT CTCCTTTCTT TTTTATTTGT CACCTTTGGT CGAGCTTACA 180 

TAATTGTGTG ACTAAAAAAA GGTCACTTCA TTCAGAAATT TAGGGTTGTG GGAATTTTGG 240 

ATTTTATTGT GTCTGTATAG AGTAGCTATA GCTAGCTAGC TAGATGTGAT GTTAATAATT 300 

ATGACGATGA GATTGGCCCG CTTGGCCGCT TGCATTGTCT CCCTAGCTCA ATAATGTTTT 360 

GAGTTTGTCT TGCCTTTCTT TCAGCTCTAA CAAATTGGAG TAGGGATGAC TGAGATACAT 42 0 

ATATAAAAGC GAAAACCGCT GCTCTCTGTT AATTATTGCA CATCACACAT AGGCCAAGCC 480 

TTAAGGACAA TCAACTAAGG ATGGTAATAA CTAAGGCTAG TGAGGTCGAA CTAGGGATGT .540 

TAATATACTC TAGATTTTAG ACTATAAAAT TTAAGGATCG AATCAGATTA GT ATC GAACT 600 
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intron 
4426. .5058 



exon 

5059 . .5250 



intron 
5251. .5382 



exon 

5383. .5548 



intron 
5549. .5649 



exon 

5650. .6065 



CDS 

join(4201. .4425, 5059.. 5250, 5383.. 5547, 5649 



ATATTTATAT TCATTTCTAA ACTAAATTAA TTAAGCACCC TAAATTATTG TGATGAAGAG 660 

ACATTTCGAT CGTGATCCAT TATTACTCCT TGGTCAAACT AATCTCGTTT TATGTCACTA 720 

TTTCATCATC TTTTTTGCGA ACGGGTTTAT AGCCCGTGTT CCATTATGAG GACATGAACG 780 

GTTTAAACAA AGTT AC AT AT CATCCCAGCT AGCTACCTAG ATTGGAAGCA TGGGTTCGGT 840 

ATATATATAT AGTTTATATA TTTGGTATAT ATATATATAT ATATATATAT ATATATATAT 900 

CACACGTCAG CTTATATTAC GTAAAGTGGG GTTAGTTTTC AAGAAGCGTG GGACCAGTCA 9 60 

CCTCTGCAGT CTGACCTTGG CTTCAGCTTC GACAGCAAAC AGTCATCTCT TGGAAGCTAA 1020 

GG AC AGTCTC CAACAGTCAA CAAAGCAGCG GTCTGCTTGT AGTTCTCCCT TGCACGACCA 1080 

GCTATATCTA GCATCATAAC AACGGTAAGA TCATCTCTAG CACGACAAAC TTAGTTTAAT 1140 

TAATTATGTC TAATCCGTTG TTGTTAGCTT AAACTTTCTA GCCTCCTATG CTAAGAGAGT 1200 

TCTCTAGTTC TACTCAGGTG GATTGATATA TAAATTGGGA ATCTTCTAGG CGTCACAAGG 12 60 

TATGGTACAC ATCAATCAAT GAACGGACAA AGCAACGGTA AGATCCGACC CAGTAAAAGT 1320 

AATAGCGTTA GGGCATGTAC AACCTAGACA CTGATGCACA GTACTCCAAG TATAAGACAC 1380 

AACTAAAACA CAACATAATA AT AC AGTGGT TATATCTAAA ACATGTGTCT TACCATATTC 1440 

ATTGTACCAA TTAGAACATT TAATAAATTA AAGTGACCAA TCAGCTAGCC TCCTGTCTCG 1500 

AACATAGAGC TAAGACATTG TGTCTTCGTC AAGATACATG TCTTAAGTTT TTTTATATTC 1560 

ACTCCCAAAG ACACACTCTA AGACACAACG TAACACACCC ATTGTACATG CTCTTAACCT 162 0 

AAGTTATCAT GGATGAC C AC GCGTGGCAAT TAAAAAAATA ATTTTTGCCT CCTAAAACCT 1680 

CTTTCTTAAT TGGTTCTTGC TTGCAAATCA CCAGCGAACC CATATGAAAG GATGCTCAAA 1740 

ATCTGGCCAC CGCATCAGGG TTGGTGAATG CAACGTAAAA AATAATGCAT AAATCAGCTC 1800 

TCTGATCAGT T AT AT AATC G TGCCTTTTAA TTATTCATGC CAGCTTTATC TG AC TC AC GA 1860 

AATCATTGAT AAATTATTCC TCAGCTGTAT TAGAAAGAGC AGTGTTGTTT AACTTGGAAA 192 0 

GTGATGTGGA AGCGTGTGAT TGCGGTTGAG CTTGTATAGG AGTAAAATGA GGAACAGTAG 1980 

GAAAATAATT TTTTCGGATT AAAACCGGTT GTTTGGACTG CGGCAGATAC AATTCATAGA 2040 

GATAAAAACA CCGTAGAAGT ATTAGAAGCC GATAAAGATT AAACCCAAAT GAACGAACAG 2100 

GCTAAACAAA TCCGGCGCCT CAAAAGTCAA GAGCAGGTAC TGGGCTGTCT TGCACACGTC 2160 

GCTTTTTGTC TCCCCCTGGC CCCTGGGTGA GAGTAGTAGG GATGCTAAAG TTTGCTTTCT 222 0 

CTTTTTGAGG CATGTGATAG GCTCTTGTTA GTTGCTAGGG CTATGTTTAT AATATTTGCG 2280 
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CTTTTACCTA TGTACGTAAG AACC GGATGG AATAATGCTA TGCAGGAACC AATTATGTTT 2340 

GGTCGAAATA TATAGTGACC TATCATAATG TTATCCCTGT TCATGTACCT AGGTGGCTAA 2400 

TG AT AT AC GG CATATGAATA CAGTAATCAT CCAAGCACGT AAAAACTCGC TAGACGTTTA 2460 

TGCCTGCTAG CCTGCTGGGT GTGTAGACTG GAGTACTGGA CAAACATCGC AATACAGAGG 2520 

TACAGTATTT GTCTAGACAA TGATATACAT AGATAAAAAC CACTGTTGTA AC TTGT AAGC 2580 

CACTAGCTCA CGTTCTCCAT GAGCTCTTCT CTCTGCTGTT TCTTCCTCTG CTAACTGCGT 2640 

TATGATATGA CGTCGTATAA ATAATCTCAC AATACTTCCT TATTTTCAGC ATGGCCTCTT 2700 

TTATGTTTAT TTAACAGTAG CAACCAACGC CGCTCGATGT TTCCTTCAAG AAACGGCCAC 27 60 

TCACTATGTG GTGTGCAGAA GAACAAATGT AAGCAGCTCC TACAGGTACC AGTAGTCATG 282 0 

TCAGTGTGGA AGCTTTCCAA CCAACGCCTC CTTCGAGGAA CCTGGTCGTG CTGACATGAA 2880 

TGTAGGCCAT GC AAGC AC AA GCACCTAACG CGAATCATCA CGACGCGCCG TGTACTGGGC 2940 

GTTGGTACAT CACACCCCGC GTTTGACCTG ATCGGAAGCA TGCGTGTGTG TTGGCTGCAG 3000 

GACCGGCTAT AGGTTTCCTG CATTGGACAG CAGAAGCCAG TCATGTTAGG CACTCACGCG 3060 

CTCCTGCCGT TTGATGAATC ATCCGGTCTT TCGTATTGAT CACTAGTTCA CTACGCTGAT 3120 

ATAGCAAATT TTAAGATGTG AAACCACGAG AC GAGCGATA AATCTTAGAC GTTACCTATC 3180 

CATATGAAGC TTGTGC G AAA AAAAGGCGTG CCGCTGTAGC ATCATTCGTA TACACTTTTG 3240 

TCCCCAAAGA CAGGGATACG AATCCATGCT CGACAGAACC CTCCCTTCCC TGCAGATAAC 3300 

GAC AC TTAAG TATAACAAAA GTAGTTGGAT TATTTCAGAA GCAAAATCTC ACTTTTCGCT 33 60 

GGCCTTTTTG T AC TTTGGTT ACTTGAGTTC AGACAGTGTA TGCTATATTG TCATGTGCTG 3420 

CGTAAGGTTT AAATATGGTT C GAC AAATAT ATCAGTATAT CACTACTTTG TTATGGGTGG 3480 

GGCCTAGCAC AAACTTGATA CAGCTAGGAT AAAGTTAGAA CGATGACTGA TCTACTGTAA 3540 

AGCGACACCT GTCCTGTTAT GGTAGTTTAA GTCCATTCCT GGACGACTCC AGATCCAGGA 3600 

TATGATGCTG TTACATAATG CGATTGTTCA CAATAAAATT GCATGATGTT CTTCTACTCT 3660 

TTAGGCAGTT TTGTTCAACA GGCAAGTTGC ATAATGCATG TGCATATATG AGCAGCATAA 3720 

TCATCAATTA ATCATAGGTT CGTCATTTTA GTTTCACTCC TTCACATTAT TCCAGCCCTT 3780 

GAAGAAAAAT GTAGCAGTGC TTGC TGTTTA ATAAGTGGCA GAGCTGTTTT CACTCCACCT 3840 

ACGCTTGTCT AGGACCAAAA TTTTAATCTG TCACTTTGAG CTAAAACTGA AGCACCAAAC 3900 

CGCTACAAAA GAACGTAGGA GCTGAATTGT AACTTGATGG GATTACTATA GCAGTTGCTA 3960 

CAGTTCTAGC TAGCTACCTT ATTCTATACG CATCACCCTA ACAACCCGGC TGACTGCTGC 4020 



4 



ATCTGACCCC ACCGTCCCCT GCTCCAAACC AACTCTCCTT TCCTTGCATG CACTACACCC 4080 

ACTTCCTGCA GCTATATATA CCACCATATG CCCATCTTAT GAAACCATCC ACAAGAGGAG 4140 

AAGAAACAAT CAACCAGCAA CACTCTTCTC TTATAACATA GTACAGCGAA GGTAACTCAC 4200 

ATG GCA ACT TCC ATG GGT TGT CTC GTC TTG CTC TGC CTT GTT TCT TCT 4248 
Met Ala Thr Ser Met Gly Cys Leu Val Leu Leu Cys Leu Val Ser Ser 
15 10 15 

CTC CTT CCC AGT GCC GTC CTT GGC CAC CCA TGG GGT GGC TTG TTC CCA 429 6 

Leu Leu Pro Ser Ala Val Leu Gly His Pro Trp Gly Gly Leu Phe Pro 
20 25 30 

CAG TTC TAT GAC CAT TCG TGC CCC AAG GCG AAG GAG ATT GTG CAG TCC 4344 
Gin Phe Tyr Asp His Ser Cys Pro Lys Ala Lys Glu lie Val Gin Ser 
35 40 45 

ATT GTG GCA CAG GCT GTG GCC AAG GAG ACC AGG ATG GCG GCA TCT TTA 4392 
He Val Ala Gin Ala Val Ala Lys Glu Thr Arg Met Ala Ala Ser Leu 
50 55 60 

GTC AGA CTG CAT TTC CAT GAC TGC TTT GTC AAG GTTCAATTCT GCTTCCTCTG 4445 
Val Arg Leu His Phe His Asp Cys Phe Val Lys 
65 70 75 

TTATGTTCTT TATATTACAT GCTCTGACAA AGCTATAAAG CTTGATACTG CAGTATAATA 4505 

TAACAAGTTA GCTACACAAG TTTTGTACTT CAAGTCTTTT AACTATATGT TGGTGCAATA 4565 

AGATTATGAG TAATCCATAT GAAGGTGTTG CAAGAGAACA TGAAAGGCAA AGATAAACGG 4625 

ATGAACCCAT TACTAGCTTT GGCTGTATCA GACCAATAAC TTGAAATGCA CTTGTGCTAG 4685 

CATGCCTAAG TATTAGAAAA GGTAGCATGG GAGAATCTAT ATTATTTTGG CTAACTTCTT 4745 

TAGTTACTAT TGATTGATGA GAAAGCCTAC CATTGCCCAT GCCAGCCCTA ATGTCCCGGT 4805 

GACATGATTG AGCCAGTACT ATGATTAATT TACTCTATTG TTCTCCTTTT TTGAGTGCTG 4865 

TATAAGATGT CCTTTTTTTG AGCCACTCGA GAAGATGTTT ACTTAACTCT AGTGCGCAAT 492 5 

GATTGGAGCT CTCAGTGCAA CGCATGTGCT CTGTAATCTA CTGTCACCAC TACTCTGTAG 4985 

TGTGTGCTTA AACTCTAAAC TATTCCACGT GGCTAGTAAT TACCAATCAT TTACAACACT 5045 

GTTACATGTG TAG GGC TGC GAT GCT TCG GTG CTG TTG GAC AAC AGC AGC 5094 
Gly Cys Asp Ala Ser Val Leu Leu Asp Asn Ser Ser 
80 85 

AGC ATA GTT AGT GAG AAA GGG TCC AAC CCG AAC AGG AAC TCC CTC AGG 5142 
Ser He Val Ser Glu Lys Gly Ser Asn Pro Asn Arg Asn Ser Leu Arg 
90 95 100 

GGG TTT GAG GTG ATC GAC CAG ATT AAG GCT GCT CTT GAG GCT GCC TGC 5190 
Gly Phe Glu Val He Asp Gin He Lys Ala Ala Leu Glu Ala Ala Cys 
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105 110 115 

CCA GGC AC A GTC TCC TGT GCC GAC ATT GTT GCC CTT GCG GCT CGT GAT 523 8 

Pro Gly Thr Val Ser Cys Ala Asp lie Val Ala Leu Ala Ala Arg Asp 
120 125 130 135 

TCC ACC GCC CTG GTATGTTCCA CTATCGACAA TCCTTTCCAA CCTCAAGGAA 529 0 

Ser Thr Ala Leu 



CAGACATGAT ATTTGTGTGT GTGTGTGTGT GTATATATAT ATATAGTGAT AGCTTTGGCA 53 50 

AACTTAGATA TTTTCTGAGC TCTAAACCGT AG GTT GGT GGA CCA TAC TGG GAC 5403 

Val Gly Gly Pro Tyr Trp Asp 
140 145 

GTG CCA CTT GGC CGG AGA GAC TCG CTC GGT GCA AGC ATC CAG GGC TCC 5451 
Val Pro Leu Gly Arg Arg Asp Ser Leu Gly Ala Ser lie Gin Gly Ser 
150 155 160 

AAC AAT GAC ATC CCA GCC CCC AAC AAC AC A CTC CCC ACT ATC ATC ACC 5499 
Asn Asn Asp lie Pro Ala Pro Asn Asn Thr Leu Pro Thr lie lie Thr 
165 170 175 

AAG TTC AAG CGC CAG GGC CTC AAT GTT GTT GAT GTT GTC GCC CTC TCA 5547 
Lys Phe Lys Arg Gin Gly Leu Asn Val Val Asp Val Val Ala Leu Ser 
180 185 190 

GGTGATTTTT C TTGTATTT A TTAGTAACAT CTGTCCTTCG TTATTCACCA ACTTAGCGCA 5607 

CACTCATATT ACGCATGGAT ACAATATCAT GTGTGAATAC A GGT GGT CAC ACC 5660 

Gly Gly His Thr 
195 

ATT GGT ATG TCT CGG TGC ACT AGT TTC CGG CAG AGG CTA TAC AAC CAG 5708 
lie Gly Met Ser Arg Cys Thr Ser Phe Arg Gin Arg Leu Tyr Asn Gin 
200 205 210 

AC A GGC AAT GGC ATG GCT GAC AGC ACA CTG GAT GTA TCC TAC GCC GCA 5756 
Thr Gly Asn Gly Met Ala Asp Ser Thr Leu Asp Val Ser Tyr Ala Ala 
215 220 225 230 

AAG CTG AGG CAG GGA TGC CCC CGC TCT GGT GGT GAC AAC AAC CTC TTC 5804 
Lys Leu Arg Gin Gly Cys Pro Arg Ser Gly Gly Asp Asn Asn Leu Phe 
235 240 245 

CCC TTG GAC TTC ATC ACC CCT GCC AAG TTT GAC AAT TTT TAC TAC AAG 5852 
Pro Leu Asp Phe lie Thr Pro Ala Lys Phe Asp Asn Phe Tyr Tyr Lys 
250 255 260 

AAC CTC CTG GCC GGC AAG GGC CTT CTA AGC TCT GAT GAG ATT CTG TTA 5900 
Asn Leu Leu Ala Gly Lys Gly Leu Leu Ser Ser Asp Glu lie Leu Leu 
265 270 275 

ACC AAG AGC GCT GAG ACA GCG GCC CTC GTG AAG GCA TAT GCT GCT GAT 5948 
Thr Lys Ser Ala Glu Thr Ala Ala Leu Val Lys Ala Tyr Ala Ala Asp 
280 285 290 
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GTC AAT CTC TTC TTC CAG CAC TTT GCA CAG TCT ATG GTG AAT ATG GGA 5996 
Val Asn Leu Phe Phe Gin His Phe Ala Gin Ser Met Val Asn Met Gly 
295 300 305 310 

AAC ATC TCG CCA CTG ACA GGG TCA CAA GGT GAG ATC AGG AAG AAC TGC 6044 
Asn lie Ser Pro Leu Thr Gly Ser Gin Gly Glu lie Arg Lys Asn Cys 
315 320 325 

AGG AGG CTC AAC AAT GAC CAC TGA GGGC AC TGAA GTCGCTTGAT GTGCTGAATT 6098 
Arg Arg Leu Asn Asn Asp His * 
330 

GTTCGTGATG TTGGTGGCGT ATTTTGTTTA AATAAGTAAG CATGGCTGTG ATTTTATCAT 6158 

ATGATCGATC TTTGGGGTTT TATTTAACAC ATTGTAAAAT GTGTATCTAT TAATAACTCA 6218 

ATGTATAAGA TGTGTTCATT CTTCGGTTGC CATAGATCTG CTTATTTGAC CTGTGATGTT 6278 

TTGACTCCAA AAACCAAAAT CACAACTCAA TAAACTCATG GAATATGTCC ACCTGTTTCT 63 3 8 

TGAAGAGTTC ATCTACCATT CCAGTTGGCA TTTATCAGTG TTGCAGCGGC GCTGTGCTTT 6398 

GTAACATAAC AATTGTTCAC GGCATATATC CAAATCTAGA GGCCTACCAA AATGAGATAA 6458 

CAAGCCAACT AATCTGCTGG GAAATAGGTA ACAAGTCTCT AACAAGATCC GTTGACCTGC 6518 

AGGTCGACCT CGAGGGGGGG CCCGGTACCC AA 65 5 0 



(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 334 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

Met Ala Thr Ser Met Gly Cys Leu Val Leu Leu Cys Leu Val Ser Ser 
15 10 15 

Leu Leu Pro Ser Ala Val Leu Gly His Pro Trp Gly Gly Leu Phe Pro 
20 25 30 

Gin Phe Tyr Asp His Ser Cys Pro Lys Ala Lys Glu lie Val Gin Ser 
35 40 45 

He Val Ala Gin Ala Val Ala Lys Glu Thr Arg Met Ala Ala Ser Leu 
50 55 60 

Val Arg Leu His Phe His Asp Cys Phe Val Lys Gly Cys Asp Ala Ser 
65 70 75 80 

Val Leu Leu Asp Asn Ser Ser Ser He Val Ser Glu Lys Gly Ser Asn 
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85 90 95 

Pro Asn Arg Asn Ser Leu Arg Gly Phe Glu Val He Asp Gin He Lys 
100 105 110 

Ala Ala Leu Glu Ala Ala Cys Pro Gly Thr Val Ser Cys Ala Asp He 
115 120 125 

Val Ala Leu Ala Ala Arg Asp Ser Thr Ala Leu Val Gly Gly Pro Tyr 
130 135 140 

Trp Asp Val Pro Leu Gly Arg Arg Asp Ser Leu Gly Ala Ser He Gin 
145 150 155 160 

Gly Ser Asn Asn Asp He Pro Ala Pro Asn Asn Thr Leu Pro Thr He 
165 170 175 

He Thr Lys Phe Lys Arg Gin Gly Leu Asn Val Val Asp Val Val Ala 
180 185 190 

Leu Ser Gly Gly His Thr He Gly Met Ser Arg Cys Thr Ser Phe Arg 
195 200 205 

Gin Arg Leu Tyr Asn Gin Thr Gly Asn Gly Met Ala Asp Ser Thr Leu 
210 215 220 

Asp Val Ser Tyr Ala Ala Lys Leu Arg Gin Gly Cys Pro Arg Ser Gly 
225 230 235 240 

Gly Asp Asn Asn Leu Phe Pro Leu Asp Phe He Thr Pro Ala Lys Phe 
245 250 255 

Asp Asn Phe Tyr Tyr Lys Asn Leu Leu Ala Gly Lys Gly Leu Leu Ser 
260 265 270 

Ser Asp Glu He Leu Leu Thr Lys Ser Ala Glu Thr Ala Ala Leu Val 
275 280 285 

Lys Ala Tyr Ala Ala Asp Val Asn Leu Phe Phe Gin His Phe Ala Gin 
290 295 300 

Ser Met Val Asn Met Gly Asn He Ser Pro Leu Thr Gly Ser Gin Gly 
305 310 315 320 

Glu He Arg Lys Asn Cys Arg Arg Leu Asn Asn Asp His * 
325 330 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (synthetic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
TTYCAYGAYT GYTTYGTYAA YGGBTG 2 6 
(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 4 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (synthetic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
SGTRTGSGCS CCGSWSAGVG CSAC 24 
(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

ATCAACCAGC AACACTCTTC TCTTATAACA TAGTACAGCG AAGGTAACTC ACATGGCAAC 60 

TTCCATGGGT TGTCTCGTCT TGCTCTGCCT TGTTTCTTCT CTCCTTCCCA GTGCCGTCCT 120 

TGGCCACCCA TGGGGTGGCT TGTTCCCACA GTTCTATGAC CATTCGTGCC CCAAGGCGAA 180 

GGAGATTGTG CAGTCCATTG TGGCACAGGC TGTGGCCAAG GAGACCAGGA TGGCGGCATC 240 

TTTAGTCAGA CTGCATTTCC ATGACTGCTT TGTCAAGGGC TGCGATGCTT CGGTGCTGTT 300 

GGACAACAGC AGCAGCATAG TTAGTGAGAA AGGGTCCAAC CCGAACAGGA ACTCCCTCAG 360 

GGGGTTTGAG GTGATCGACC AGATTAAGGC TGCTCTTGAG GCTGCCTGCC CAGGCACAGT 420 

CTCCTGTGCC GACATTGTTG CCCTTGCGGC TCGTGATTCC ACCGCCCTGG TTGGTGGACC 480 

ATACTGGGAC GTGC CACTTG GCCGGAGAGA CTCGCTCGGT GCAAGCATCC AGGGCTCCAA 540 
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CAATGACATC CCAGCCCCCA ACAACACACT 
GGGCCTCAAT GTTGTTGATG TTGTCGCCCT 
GTGCACTAGT TTCCGGCAGA GGCTATACAA 
AC TGGATGT A TCCTACGCCG CAAAGCTGAG 
CAACCTCTTC CCCTTGGACT TCATCACCCC 
CCTCCTGGCC GGCAAGGGCC TTCTAAGCTC 
GACAGCGGCC CTCGTGAAGG CATATGCTGC 
ACAGTCTATG GTGAATATGG GAAACATCTC 
GAAGAACTGC AGGAGGCTCA ACAATGACCA 
AATTGTTCGT GATGTTGGTG GCGTATTTTG 
TCATATGATC GATCTTTGGG GTTTTATTTA 
CTCAATGTAT AAGATGTGTT CATTCTTCGG 
TGTTTTGACT CCAAAAACCA AAATCACAAC 
TTCTTGAAAA AAAAAAAAAA AAAAAAAAAA 
(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



CCCCACTATC ATCACCAAGT TCAAGCGCCA 
CTCAGGTGGT CACACCATTG GTATGTCTCG 
CCAGACAGGC AATGGCATGG CTGACAGCAC 
GCAGGGATGC CCCCGCTCTG GTGGTGACAA 
TGCCAAGTTT GACAATTTTT ACTACAAGAA 
TGATGAGATT CTGTTAACCA AGAGC GCTGA 
TGATGTCAAT CTCTTCTTCC AGCACTTTGC 
GCCACTGACA GGGTCACAAG GTGAGATCAG 
CTGAGGGCAC TGAAGTCGCT TGATGTGCTG 
TTTAAATAAG TAAGCATGGC TGTGATTTTA 
ACACATTGTA AAATGTGTAT CTATTAATAA 
TTGCCATAGA TCTGCTTATT TGACCTGTGA 
TCAATAAACT CATGGAATAT GTCCACCTGT 
AAAA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
GTCATAGAAC TGTGGG 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 



ATAACATAGT ACAGCG 16 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10160 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

GGGCCCGCTA GCGGTACCCC C GGGGTC G AC CATGGTCCGT CCTGTAGAAA CCCCAACCCG 60 

TGAAATCAAA AAACTCGACG GCCTGTGGGC ATTCAGTCTG GATCGCGAAA ACTGTGGAAT 12 0 

TGATCAGCGT TGGTGGGAAA GCGCGTTACA AGAAAGCCGG GCAATTGCTG TGCCAGGCAG 180 

TTTTAACGAT CAGTTCGCCG ATGCAGATAT TCGTAATTAT GCGGGCAACG TCTGGTATCA 240 

GCGCGAAGTC TTTATACCGA AAGGTTGGGC AGGCCAGCGT ATCGTGCTGC GTTTCGATGC 300 

GGTCACTCAT TACGGCAAAG TGTGGGTCAA TAATCAGGAA GTGATGGAGC ATCAGGGCGG 3 60 

CTATACGCCA TTTGAAGCCG ATGTCACGCC GTATGTTATT GCCGGGAAAA GTGTACGTAT 420 

CACCGTTTGT GTGAACAACG AACTGAACTG GCAGACTATC CCGCCGGGAA TGGTGATTAC 480 

CGACGAAAAC GGCAAGAAAA AGCAGTCTTA CTTCCATGAT TTCTTTAACT ATGCCGGAAT 540 

CCATCGCAGC GTAATGCTCT ACACCACGCC GAACACCTGG GTGGACGATA TCACCGTGGT 600 

GACGCATGTC GCGCAAGACT GTAACCACGC GTCTGTTGAC TGGCAGGTGG TGGCCAATGG 660 

TGATGTCAGC GTTGAACTGC GTGATGC GGA TCAACAGGTG GTTGCAACTG GACAAGGCAC 720 

TAGC GGGACT TTGCAAGTGG TGAATCCGCA CCTCTGGCAA CCGGGTGAAG GTTATCTCTA 780 

TGAACTGTGC GTCACAGCCA AAAGCCAGAC AGAGTGTGAT ATCTACCCGC TTCGCGTCGG 840 

CATCCGGTCA GTGGCAGTGA AGGGCGAACA GTTCCTGATT AACCACAAAC CGTTCTACTT 900 

TACTGGCTTT GGTCGTCATG AAGATGC GGA CTTACGTGGC AAAGGATTC G ATAACGTGCT 9 60 

GATGGTGCAC GACCACGCAT TAATGGACTG GATTGGGGCC AACTCCTACC GTACCTCGCA 1020 

TTACCCTTAC GCTGAAGAGA TGCTCGACTG GGCAGATGAA CATGGCATCG TGGTGATTGA 1080 

TGAAAC TGCT GCTGTCGGCT TTAACCTCTC TTTAGGCATT GGTTTCGAAG CGGGCAACAA 1140 
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GCCGAAAGAA CTGTACAGCG AAGAGGCAGT CAACGGGGAA ACTCAGCAAG CGCACTTACA 1200 

GGC GATT AAA GAGCTGATAG CGCGTGACAA AAACCACCCA AGCGTGGTGA TGTGGAGTAT 12 60 

TGCCAACGAA CCGGATACCC GTCCGCAAGT GCACGGGAAT ATTTCGCCAC TGGCGGAAGC 1320 

AAC GC GT AAA CTCGACCCGA CGCGTCCGAT CACCTGCGTC AATGTAATGT TCTGCGACGC 1380 

TCACACCGAT ACCATCAGCG ATCTCTTTGA TGTGCTGTGC CTGAACCGTT ATT AC GGATG 1440 

GTATGTCCAA AGC GGC GATT TGGAAACGGC AGAGAAGGTA CTGGAAAAAG AACTTCTGGC 1500 

CTGGCAGGAG AAACTGCATC AGCC GATTAT CATC AC C G AA TACGGCGTGG ATACGTTAGC 1560 

CGGGCTGCAC TCAATGTACA CCGACATGTG GAGTGAAGAG TATCAGTGTG CATGGCTGGA 1620 

TATGTATCAC CGCGTCTTTG ATCGCGTCAG CGCCGTCGTC GGTGAACAGG TATGGAATTT 1680 

CGCCGATTTT GCGACCTCGC AAGGCATATT GCGCGTTGGC GGTAACAAGA AAGGGATCTT 1740 

CACTCGCGAC CGCAAACCGA AGTCGGCGGC TTTTCTGCTG CAAAAACGCT GGACTGGCAT 1800 

GAACTTCGGT GAAAAACCGC AGCAGGGAGG CAAACAATGA ATCAACAACT CTCCTGGCGC 1860 

ACCATCGTCG GCTACAGCCT CGGTGGGGAA TTGGAGCTCG AATTTCCCCG ATCGTTCAAA 1920 

CATTTGGCAA TAAAGTTTCT TAAGATTGAA TCCTGTTGCC GGTCTTGCGA TGATTATCAT 1980 

ATAATTTCTG TTGAATTACG TTAAGCATGT AATAATTAAC ATGTAATGCA TGACGTTATT 2040 

TATGAGATGG GTTTTTATGA TTAGAGTCCC GCAATTATAC ATTTAATACG C G AT AG AAAA 2100 

CAAAATATAG CGCGCAAACT AGGATAAATT ATCGCGCGCG GTGTCATCTA TGTTACTAGA 2160 

TCGATCGGGA ATTAAGCTTA GATCTGCATG GGTGGAGACT TTTCAACAAA GGGTAATATC 2220 

CGGAAACCTC CTCGGATTCC ATTGCCCAGC TATCTGTCAC TTTATTGTGA AGATAGTGGA 2280 

AAAGGAAGGT GGCTCCTACA AATGC CATC A TTGCGATAAA GGAAAGGCCA TCGTTGAAGA 2340 

TGCCTCTGCC GACAGTGGTC CCAAAGATGG ACCCCCACCC ACGAGGAGCA TCGTGGAAAA 2400 

AGAAGACGTT CCAACCACGT CTTCAAAGCA AGTGGATTGA TGTGATCATC GATGGAGACT 2460 

TTTCAACAAA GGGTAATATC CGGAAACCTC CTCGGATTCC ATTGCCCAGC TATCTGTCAC 2520 

TTTATTGTGA AGATAGTGGA AAAGGAAGGT GGCTCCTACA AATGCCATCA TTGCGATAAA 2580 

GGAAAGGCCA TCGTTGAAGA TGCCTCTGCC GACAGTGGTC CCAAAGATGG ACCCCCACCC 2640 

ACGAGGAGCA TCGTGGAAAA AGAAGACGTT CCAACCACGT CTTCAAAGCA AGTGGATTGA 27 00 

TGTGATATCT CCACTGACGT AAGGGATGAC GCACAATCCC ACTATCCTTC GCAAGACCCT 2760 

TCCTCTATAT AAGGAAGTTC ATTTCATTTG GAGAGAACAC GGGGGACTCT AGAGGATCCA 2820 
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GCTGAAGGCT CGACAAGGCA GTCCACGGAG GAGCTGATAT TTGGTGGACA AGCTGTGGAT 2880 

AGGAGCAACC CTATCCCTAA TATACCAGCA CCACCAAGTC AGGGCAATCC CCAGATCAAG 2940 

TGCAAAGGTC CGCCTTGTTT CTCCTCTGTC TCTTGATCTG ACTAATCTTG GTTTATGATT 3000 

CGTTGAGTAA TTTTGGGGAA AGCTCCTTTG CTGCTCCACA CATGTCCATT CGAATTTTAC 3060 

CGTGTTTAGC AAGGGCGAAA AGTTTGCATC TTGATGATTT AGCTTGACTA TGCGATTGCT 312 0 

TTCCTGGACC CGTGCAGCTG CGCTCGGATC TGGGGCCATT TG^TCCAGGC AC GGGAT AAG 3180 

CATTCAGCCA TGGCAGACGC CAAAAACATA AAG AAAGGC C CGGCGCCATT CTATCCTCTA 3240 

GAGGATGGAA CCGCTGGAGA GCAACTGCAT AAGGCTATGA AGAGATACGC CCTGGTTCCT 33 00 

GGAACAATTG CTTTTACAGA TGCACATATC GAGGTGAACA TCACGTACGC GGAATACTTC 33 60 

GAAATGTCCG TTCGGTTGGC AGAAGCTATG AAACGATATG GGCTGAATAC AAATCACAGA 3420 

ATCGTCGTAT GCAGTGAAAA CTCTCTTCAA TTCTTTATGC CGGTGTTGGG CGCGTTATTT 3480 

ATCGGAGTTG CAGTTGCGCC CGCGAACGAC ATTTATAATG AAC GTGAATT GCTCAACAGT 3540 

ATGAACATTT CGCAGCCTAC CGTAGTGTTT GTTTCCAAAA AGGGGTTGCA AAAAATTTTG 3 600 

AAC GTGC AAA AAAAATTACC AATAATC C AG AAAATTATTA TCATGGATTC TAAAACGGAT 3 6 60 

TACCAGGGAT TTCAGTCGAT GTACACGTTC GTCACATCTC ATCTACCTCC CGGTTTTAAT 372 0 

GAAT AC GATT TTGTACCAGA GTCCTTTGAT CGTGACAAAA CAATTGCACT GATAATGAAT 3780 

TCCTCTGGAT CTACTGGGTT ACCTAAGGGT GTGGCCCTTC CGCATAGAAC TGCCTGCGTC 3 840 

AGATTCTCGC ATGCCAGAGA TCCTATTTTT GGCAATCAAA TCATTCCGGA TACTGCGATT 39 00 

TTAAGTGTTG TTCCATTCCA TCACGGTTTT GGAATGTTTA CTACACTCGG ATATTTGATA 3960 

TGTGGATTTC GAGTCGTCTT AATGTATAGA TTTGAAGAAG AGCTGTTTTT ACGATCCCTT 4020 

CAGGATTACA AAATTCAAAG TGCGTTGCTA GTACCAACCC TATTTTCATT CTTCGCCAAA 4080 

AGCACTCTGA TTGACAAATA CGATTTATCT AATTTACACG AAATTGCTTC TGGGGGCGCA 4140 

CCTCTTTCGA AAGAAGTCGG GGAAGCGGTT GCAAAACGCT TCCATCTTCC AGGGATACGA 42 00 

CAAGGATATG GGCTCACTGA GACTACATCA GC T ATTCTG A TTACACCCGA GGGGGATGAT 42 60 

AAACCGGGCG CGGTCGGTAA AGTTGTTC C A TTTTTTGAAG CGAAGGTTGT GGATCTGGAT 432 0 

ACCGGGAAAA CGCTGGGCGT TAATCAGAGA GGC GAATT AT GTGTCAGAGG AC CT ATGATT 43 80 

ATGTCC GGTT ATGTAAACAA TCCGGAAGCG ACCAACGCCT TGATTGACAA GGATGGATGG 4440 

CT AC ATTCTG GAGACATAGC TTACTGGGAC GAAGACGAAC ACTTCTTCAT AGTTGACCGC 4500 

TTGAAGTCTT TAATTAAATA CAAAGGATAT CAGGTGGCCC CCGCTGAATT GGAATCGATA 4560 
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TTGTTACAAC ACCCCAACAT CTTCGACGCG GGCGTGGCAG GTCTTCCCGA CGATGACGCC 4620 

GGTGAACTTC CCGCCGCCGT TGTTGTTTTG GAGCACGGAA AGACGATGAC GGAAAAAGAG 4680 

ATCGTGGATT ACGTCGCCAG TCAAGTAACA ACCGCGAAAA AGTTGCGCGG AGGAGTTGTG 4740 

TTTGTGGACG AAGTACCGAA AGGTCTTACC GGAAAACTCG ACGCAAGAAA AATCAGAGAG 4800 

ATCCTCATAA AGGCCAAGAA GGGCGGAAAG TCCAAATTGT AAAATGTAAC TGTATTCAGC 48 60 

GATGAC G AAA TTCTTAGCTA TTGTAATCAG ATCCGCGAAT TTCCCCGATC GTTCAAACAT 492 0 

TTGGCAATAA AGTTTCTTAA GATTGAATCC TGTTGC CGGT CTTGCGATGA TTATCATATA 4980 

ATTTCTGTTG AATT AC GTT A AGCATGTAAT AATTAACATG TAATGCATGA CGTTATTTAT 5040 

GAGATGGGTT TTTATGATTA GAGTCCCGCA ATTATACATT TAATACGCGA TAGAAAACAA 5100 

AATATAGCGC GCAAACTAGG ATAAATTATC GCGCGCGGTG TCATCTATGT TACTAGATCG 5160 

ATCGGGAATT GAGATCTCAT ATGTCGAGCT CGGGGATCTC CTTTGCCCCA GAGATCACAA 5220 

TGGACGACTT CCTCTATCTC TACGATCTAG TCAGGAAGTT CGACGGAGAA GGTGACGATA 5280 

CCATGTTCAC CACTGATAAT GAGAAGATTA GCCTTTTCAA TTTCAGAAAG AATGCTAACC 5340 

CACAGATGGT TAGAGAGGCT TACGCAGCAG GTCTCATCAA GAC GATCTAC CCGAGCAATA 5400 

ATCTCCAGGA GATCAAATAC CTTCCCAAGA AGGTTAAAGA TGCAGTCAAA AGATTCAGGA 5460 

CTAACTGCAT CAAGAACACA GAGAAAGATA TATTTCTCAA GATCAGAAGT ACTATTCCAG 5520 

TATGGACGAT TCAAGGCTTG CTTCACAAAC CAAGGCAAGT AATAGAGATT GGAGTCTCTA 5580 

AAAAGGTAGT TCCCACTGAA TCAAAGGCCA TGGAGTCAAA GATTCAAATA GAGGACCTAA 5640 

CAGAACTCGC CGTAAAGACT GGCGAACAGT TCCATCGATG ATTGAGACTT TTCAACAAAG 57 00 

GGTAATATCC GGAAACCTCC TCGGATTCCA TTGCCCAGCT ATCTGTCACT TTATTGTGAA 57 60 

GATAGTGGAA AAGGAAGGTG GCTCCTACAA ATGCCATCAT TGCGATAAAG GAAAGGC C AT 5820 

CGTTGAAGAT GCCTCTGCCG ACAGTGGTCC CAAAGATGGA CCCCCACCCA CGAGGAGCAT 5880 

CGTGGAAAAA GAAGACGTTC CAACCACGTC TTCAAAGCAA GTGGATTGAT GTGATATCTC 5940 

CACTGACGTA AGGGATGACG CACAATCCCA CTATCCTTCG CAAGACCCTT CCTCTATATA 6000 

AGGAAGTTCA TTTCATTTGG AGAGGACACG CTGACAAGCT CGGATCCTTT AGCATGATTG 6060 

AACAAGATGG ATTGC AC GC A GGTTCTCCGG CCGCTTGGGT GGAGAGGCTA TTCGGCTATG 6120 

ACTGGGCACA ACAGACAATC GGCTGCTCTG ATGCCGCCGT GTTCCGGCTG TCAGCGCAGG 6180 

GGCGCCCGGT TCTTTTTGTC AAGAC CGACC TGTCCGGTGC CCTGAATGAA CTGCAGGACG 6240 
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AGGCAGCGCG GCTATCGTGG CTGGCCACGA CGGGCGTTCC TTGCGCAGCT GTGCTCGACG 63 00 

TTGTCACTGA AGCGGGAAGG GACTGGCTGC T ATTGGGC G A AGTGCCGGGG CAGGATCTCC 63 60 

TGTCATCTCA CCTTGCTCCT GCCGAGAAAG TATCCATCAT GGCTGATGCA ATGCGGCGGC 6420 

TGCATACGCT TGATCCGGCT ACCTGCCCAT TCGACCACCA AGCGAAACAT CGCATCGAGC 6480 

GAGCACGTAC TCGGATGGAA GCCGGTCTTG TCGATCAGGA TGATCTGGAC GAAGAGCATC 6540 

AGGGGCTCGC GCCAGCCGAA CTGTTCGCCA GGCTCAAGGC GCGCATGCCC GAC GGCGAGG 6600 

ATCTCGTCGT GACCCATGGC GATGCCTGCT TGCCGAATAT CATGGTGGAA AATGGCCGCT 6660 

TTTCTGGATT CATCGACTGT GGCCGGCTGG GTGTGGCGGA CCGCTATCAG GACATAGCGT 67 2 0 

TGGCTACCCG TGATATTGCT GAAGAGCTTG GCGGCGAATG GGCTGACCGC TTCCTCGTGC 67 80 

TTTACGGTAT CGCCGCTCCC GATTCGCAGC GCATCGCCTT CTATCGCCTT CTTGACGAGT 6840 

TCTTCTGAGC GGGACTCTGG GGTTC GAAAT GACCGACCAA GCGACGCCCA ACCTGCCATC 6900 

ACGAGATTTC GATTCCACCG CCGCCTTCTA TGAAAGGTTG GGCTTCGGAA TCGTTTTCCG 69 60 

GGACGCCGGC TGGATGATCC TCCAGCGCGG GGATCTCATG CTGGAGTTCT TCGCCCACCC 702 0 

CAACAGAGGT GGATGGACAG ACCCGTTCTT ACACCGGACT GGGCGCGGGA TAGGATATTC 7080 

AGATTGGGAT GGGATTGAGC TTAAAGCCGG CGCTGAGACC ATGCTCAAGG TAGGCAATGT 7140 

CCTCAGCGTC GAGCCCGGCA TCTATGTCGA GGGCATTGGT GGAGCGCGCT TCGGGGATAC 7200 

CGTGCTTGTA ACTGAGACCG GATATGAGGC CCTCACTCCG CTTGATCTTG GCAAAGATAT 7260 

TTGACGCATT TATTAGTATG TGTTAATTTT CATTTGCAGT GCAGTATTTT CTATTCGATC 732 0 

TTTATGTAAT TCGTTACAAT TAATAAATAT TCAAATCAGA TTATTGACTG TCATTTGTAT 7380 

CAAATCGTGT TTAATGGATA TTTTTATTAT AATATTGATG ATATCTCAAT CAAAACGTAG 7440 

ATAATAATAA TATTTATTTA ATATTTTTGC GTCGCACAGT GAAAATCTAT ATGAGATTAC 7500 

AAAATACCGA CAACATTATT TAAGATACAT AGACATTAAC CCTGAGACTG TTGGACATCA 7560 

ACGGGTAGAT TCCTTCATGC ATAGCACCTC ATTCTTGGGG ACAAAAGCAC GGTTTGGCCG 762 0 

TTCCATTGCT GC AC GAACGA GCTTTGCTAT ATCCTCGGGT TGGATCATCT CATCAGGTCC 7680 

AATCAAATTT GTCCAAGAAC TCATGTTAGT CGCAACGAAA C CGGGGC AT A TGGTGCACTC 7740 

TCAGTACAAT CTGCTCTGAT GCCGCATAGT TAAGCCAGCC CCGACACCCG CCAACACCCG 7800 

CTGACGCGCC CTGACGGGCT TGTCTGCTCC CGGCATCCGC TTACAGACAA GCTGTGACCG 7860 

TCTCCGGGAG CTGCATGTGT CAGAGGTTTT CACCGTCATC ACCGAAACGC GCGAGACGAA 792 0 

AGGGCCTCGT GATACGCCTA TTTTTATAGG TTAATGTCAT GATAATAATG GTTTCTTAGA 7980 
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CGTCAGGTGG CACTTTTCGG GGAAATGTGC GCGGAACCCC TATTTGTTTA TTTTTCTAAA 8040 

TACATTCAAA TATGTATCCG CTCATGAGAC AATAACCCTG ATAAATGCTT CAATAATATT 8100 

GAAAAAGGAA GAGTATGAGT ATTCAACATT TCCGTGTCGC CCTTATTCCC TTTTTTGCGG 8160 

CATTTTGCCT TCCTGTTTTT GCTCACCCAG AAACGCTGGT GAAAGTAAAA GATGCTGAAG 82 2 0 

ATCAGTTGGG TGCACGAGTG GGTTACATCG AACTGGATCT CAACAGCGGT AAGATCCTTG 82 80 

AGAGTTTTCG CCCCGAAGAA CGTTTTCCAA TGATGAGCAC TTTTAAAGTT CTGCTATGTG 8340 

GCGCGGTATT ATCCCGTATT GACGCCGGGC AAGAGCAACT CGGTCGCCGC ATACACTATT 8400 

CTCAGAATGA CTTGGTTGAG TACTCACCAG TCACAGAAAA GCATCTTACG GATGGCATGA 846 0 

CAGTAAGAGA ATTATGCAGT GCTGCCATAA CCATGAGTGA TAACACTGCG GCCAACTTAC 852 0 

TTCTGACAAC GATCGGAGGA CCGAAGGAGC TAACCGCTTT TTTGCACAAC ATGGGGGATC 8580 

ATGTAACTCG CCTTGATCGT TGGGAACCGG AGCTGAATGA AGCCATACCA AACGACGAGC 8640 

GTGACACCAC GATGCCTGTA GCAATGGCAA CAACGTTGCG CAAACTATTA ACTGGCGAAC 8700 

TACTTACTCT AGCTTCCCGG CAACAATTAA TAGAC TGGAT GGAGGCGGAT AAAGTTGCAG 87 60 

GACCACTTCT GCGCTCGGCC CTTCCGGCTG GC TGGTTT AT TGCTGATAAA TCTGGAGCCG 8820 

i 

GTGAGC GTGG GTCTCGCGGT ATCATTGCAG CACTGGGGCC AGATGGTAAG CCCTCCCGTA 8880 

TCGTAGTTAT CTACACGACG GGGAGTCAGG CAACTATGGA TGAACGAAAT AGACAGATCG 8940 

CTGAGATAGG TGCCTCACTG ATTAAGCATT GGTAACTGTC AGACCAAGTT TACTCATATA 9000 

TACTTTAGAT TGATTTAAAA CTTCATTTTT AATTTAAAAG GATCTAGGTG AAGATCCTTT 9060 

TTGATAATCT CATGACCAAA ATCCCTTAAC GTGAGTTTTC GTTCCACTGA GCGTCAGACC 9120 

CCGTAGAAAA GATCAAAGGA TCTTCTTGAG ATCCTTTTTT TCTGCGCGTA ATCTGCTGCT 9180 

TGCAAACAAA AAAACCACCG CTACCAGCGG TGGTTTGTTT GCCGGATCAA GAGCTACCAA 9240 

CTCTTTTTCC GAAGGTAACT GGCTTCAGCA GAGCGCAGAT ACCAAATACT GTCCTTCTAG 9300 

TGTAGCCGTA GTTAGGCCAC CACTTCAAGA ACTCTGTAGC ACCGCCTACA TACCTCGCTC 93 60 

TGCTAATCCT GTTACCAGTG GCTGCTGCCA GTGGCGATAA GTCGTGTCTT ACCGGGTTGG 9420 

AC TC AAGACG ATAGTTACCG GATAAGGCGC AGCGGTCGGG CTGAACGGGG GGTTCGTGCA 9480 

CACAGCCCAG CTTGGAGCGA ACGACCTACA CCGAACTGAG ATACCTACAG CGTGAGCATT 9540 

GAGAAAGCGC CACGCTTCCC GAAGGGAGAA AGGCGGACAG GTATCCGGTA AGCGGCAGGG 9600 

TCGGAACAGG AGAGCGCACG AGGGAGCTTC CAGGGGGAAA CGCCTGGTAT CTTTATAGTC 9660 
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CTGTCGGGTT TCGCCACCTC TGACTTGAGC GTC GATTTTT GTGATGCTCG TCAGGGGGGC 9720 

GGAGCCTATG GAAAAACGCC AGCAACGCGG CCTTTTTACG GTTCCTGGCC TTTTGCTGGC 9780 

CTTTTGCTCA CATGTTCTTT CCTGCGTTAT CCCCTGATTC TGTGGATAAC CGTATTACCG 9840 

CCTTTGAGTG AGCTGATACC GCTCGCCGCA GCCGAACGAC CGAGCGCAGC GAGTCAGTGA 9900 

GCGAGGAAGC GGAAGAGCGC CCAATACGCA AACCGCCTCT CCCCGCGCGT TGGCCGATTC 9960 

ATTAATGCAG CTGGCACGAC AGGTTTCCCG ACTGGAAAGC GGGCAGTGAG CGCAACGCAA 10020 

TTAATGTGAG TTAGCTCACT CATTAGGCAC CCCAGGCTTT ACACTTTATG CTTCCGGCTC 10080 

GTATGTTGTG TGGAATTGTG AGCGGATAAC AATTTC AC AC AGGAAACAGC TATGACCATG 10140 

ATTACGCCAA GCTTCCGCGG 10160 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11784 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

GGGCCCACCA CTGTTGTAAC TTGTAAGCCA CTAGCTCACG TTCTCCATGA GCTCTTCTCT 60 

CTGCTGTTTC TTCCTCTGCT AACTGCGTTA TGATATGACG TCGTATAAAT AATCTCACAA 120 

TACTTCCTTA TTTTCAGCAT GGCCTCTTTT ATGTTTATTT AACAGTAGCA ACCAACGCCG 180 

CTCGATGTTT CCTTCAAGAA ACGGCCACTC ACTATGTGGT GTGCAGAAGA ACAAATGTAA 240 

GCAGCTCCTA CAGGTACCAG TAGTCATGTC AGTGTGGAAG CTTTCCAACC AACGCCTCCT 300 

TCGAGGAACC TGGTCGTGCT GACATGAATG TAGGCCATGC AAGCACAAGC ACCTAACGCG 3 60 

AATCATCACG ACGCGCCGTG TACTGGGCGT TGGTACATCA CACCCCGCGT TTGACCTGAT 42 0 

CGGAAGCATG CGTGTGTGTT GGCTGCAGGA CCGGCTATAG GTTTCCTGCA TTGGACAGCA 480 

GAAGCCAGTC ATGTTAGGCA CTCACGCGCT CCTGCCGTTT GATGAATCAT CCGGTCTTTC 540 

GTATTGATCA CTAGTTCACT ACGCTGATAT AGCAAATTTT AAGATGTGAA ACCACGAGAC 600 

GAGCGATAAA TCTTAGACGT TACCTATCCA TATGAAGCTT GTGCGAAAAA AAGGCGTGCC 660 

GCTGTAGCAT CATTCGTATA CACTTTTGTC CCCAAAGACA GGGATACGAA TCCATGCTCG 720 

ACAGAACCCT CCCTTCCCTG CAGATAACGA CACTTAAGTA TAACAAAAGT AGTTGGATTA 780 

TTTCAGAAGC AAAATCTCAC TTTTCGCTGG CCTTTTTGTA CTTTGGTTAC TTGAGTTCAG 840 
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ACAGTGTATG CTATATTGTC ATGTGCTGCG TAAGGTTTAA ATATGGTTCG ACAAATATAT 900 

CAGTATATCA CTACTTTGTT ATGGGTGGGG CCTAGCACAA ACTTGATACA GCTAGGATAA 960 

AGTTAGAACG ATGACTGATC TACTGTAAAG CGACACCTGT CCTGTTATGG TAGTTTAAGT 1020 

CCATTCCTGG ACGACTCCAG ATCCAGGATA TGATGCTGTT ACATAATGCG ATTGTTCACA 1080 

ATAAAATTGC ATGATGTTCT TCTACTCTTT AGGCAGTTTT GTTCAACAGG CAAGTTGCAT 1140 

AATGCATGTG CATATATGAG CAGCATAATC ATCAATTAAT CATAGGTTCG TCATTTTAGT 1200 

TTCACTCCTT CACATTATTC CAGCCCTTGA AGAAAAATGT AGCAGTGCTT GCTGTTTAAT 12 60 

AAGTGGCAGA GCTGTTTTCA CTCCACCTAC GCTTGTCTAG GACCAAAATT TTAATCTGTC 132 0 

ACTTTGAGCT AAAACTGAAG CACCAAACCG C T AC AAAAGA ACGTAGGAGC TGAATTGTAA 1380 

CTTGATGGGA TTACTATAGC AGTTGCTACA GTTCTAGCTA GCTACCTTAT TCTATACGCA 1440 

TCACCCTAAC AACCCGGCTG ACTGCTGCAT CTGACCCCAC CGTCCCCTGC TCCAAACCAA 1500 

CTCTCCTTTC CTTGCATGCA CTACACCCAC TTCCTGCAGC TATATATACC ACCATATGCC 15 60 

CATCTTATGA AACCATCCAC AAGAGGAGAA GAAACAATCA ACCAGCAACA CTCTTCTCTT 1620 

ATAACATAGT ACAGCGAAGG TAACTCACGT CGACCATGGT CCGTCCTGTA GAAACCCCAA 1680 

CCCGTGAAAT CAAAAAACTC GACGGCC TGT GGGCATTCAG TCTGGATCGC GAAAACTGTG 1740 

GAATTG ATC A GCGTTGGTGG GAAAGCGCGT TACAAGAAAG CCGGGCAATT GCTGTGCCAG 1800 

GCAGTTTTAA CGATCAGTTC GCCGATGCAG ATATTCGTAA TTATGCGGGC AACGTCTGGT 1860 

ATCAGCGCGA AGTCTTTATA CCGAAAGGTT GGGCAGGCCA GCGTATCGTG CTGCGTTTCG 1920 

ATGCGGTCAC TCATTACGGC AAAGTGTGGG TCAATAATCA GGAAGTGATG GAGCATCAGG 1980 

GCGGCTATAC GCCATTTGAA GCCGATGTCA CGCCGTATGT TATTGCCGGG AAAAGTGTAC 2040 

GTATCACCGT TTGTGTGAAC AACGAACTGA ACTGGCAGAC TATCCCGCCG GGAATGGTGA 2100 

TTACCGACGA AAACGGCAAG AAAAAGCAGT CTTACTTCCA TGATTTCTTT AACTATGCCG 2160 

GAATCCATCG CAGCGTAATG CTCTACACCA CGCCGAACAC CTGGGTGGAC GATATCACCG 2220 

TGGTGACGCA TGTCGCGCAA GACTGTAACC ACGCGTCTGT TGACTGGCAG GTGGTGGCCA 2280 

ATGGTGATGT CAGCGTTGAA CTGCGTGATG CGGATCAACA GGTGGTTGCA ACTGGACAAG 2340 

GCACTAGCGG GACTTTGCAA GTGGTGAATC CGCACCTCTG GCAACCGGGT GAAGGTTATC 2400 

TCTATGAACT GTGCGTCACA GCCAAAAGCC AGACAGAGTG TGATATCTAC CCGCTTCGCG 2460 

TCGGCATCCG GTCAGTGGCA GTGAAGGGCG AACAGTTCCT GATTAACCAC AAACCGTTCT 2520 
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ACTTTACTGG CTTTGGTCGT CATGAAGATG CGGACTTACG TGGCAAAGGA TTCGATAACG 258 0 

TGCTGATGGT GCACGACCAC GCATTAATGG ACTGGATTGG GGCCAACTCC TACCGTACCT 2640 

CGCATTACCC TTACGCTGAA GAGATGCTCG ACTGGGCAGA TGAACATGGC ATCGTGGTGA 27 00 

TTGATGAAAC TGCTGCTGTC GGCTTTAACC TCTCTTTAGG CATTGGTTTC GAAGCGGGCA 2760 

ACAAGCCGAA AGAACTGTAC AGCGAAGAGG CAGTCAACGG GGAAAC TC AG CAAGCGCACT 282 0 

TACAGGCGAT TAAAGAGCTG ATAGCGCGTG ACAAAAACCA CCCAAGCGTG GTGATGTGGA 2880 

GTATTGCCAA CGAACCGGAT ACCCGTCCGC AAGTGCACGG GAATATTTCG CCACTGGCGG 2940 

AAGCAACGCG TAAACTCGAC CCGACGCGTC CGATCACCTG CGTCAATGTA ATGTTCTGCG 3000 

ACGCTCACAC CGATACCATC AGCGATCTCT TTGATGTGCT GTGCCTGAAC CGTTATTACG 3060 

GATGGTATGT CCAAAGCGGC GATTTGGAAA CGGCAGAGAA GGTACTGGAA AAAGAACTTC 3120 

TGGCCTGGCA GGAGAAACTG CATCAGCCGA TTATCATCAC CGAATACGGC GTGGATACGT 3180 

TAGCCGGGCT GCACTCAATG TACACCGACA TGTGGAGTGA AGAGTATCAG TGTGCATGGC 3240 

TGGATATGTA TCACCGCGTC TTTGATCGCG TCAGCGCCGT CGTCGGTGAA CAGGTATGGA 3300 

ATTTCGCCGA TTTTGCGACC TCGCAAGGCA TATTGCGCGT TGGCGGTAAC AAGAAAGGGA 3360 

TCTTCACTCG CGACCGCAAA CCGAAGTCGG CGGCTTTTCT GCTGCAAAAA CGCTGGACTG 3420 

GCATGAACTT CGGTGAAAAA CCGCAGCAGG GAGGCAAACA ATGAATCAAC AACTCTCCTG 3480 

GCGCACCATC GTCGGCTACA GCCTCGGTGG GGAATTGGAG CTCGAATTTC CCCGATCGTT 3540 

CAAACATTTG GCAATAAAGT TTCTTAAGAT TGAATCCTGT TGCCGGTCTT GCGATGATTA 3 600 

TCATATAATT TCTGTTGAAT TACGTTAAGC ATGTAATAAT TAACATGTAA TGCATGACGT 3660 

TATTTATGAG ATGGGTTTTT ATGATTAGAG TCCCGCAATT ATACATTTAA T ACGC GAT AG 3720 

AAAACAAAAT AT AGCGC GC A AACTAGGATA AATTATCGCG CGCGGTGTCA TCTATGTTAC 3780 

TAGATCGATC GGGAATTAAG CTTAGATCTG CATGGGTGGA GACTTTTCAA CAAAGGGTAA 3840 

TATCCGGAAA CCTCCTCGGA TTCCATTGCC CAGCTATCTG TCACTTTATT GTGAAGATAG 3900 

TGGAAAAGGA AGGTGGCTCC TACAAATGCC ATCATTGCGA TAAAGGAAAG GCCATCGTTG 3960 

AAGATGCCTC TGCCGACAGT GGTCCCAAAG ATGGACCCCC ACCCACGAGG AGCATCGTGG 4020 

AAAAAGAAGA CGTTCCAACC ACGTCTTCAA AGCAAGTGGA TTGATGTGAT CATCGATGGA 4080 

GACTTTTCAA CAAAGGGTAA TATCCGGAAA CCTCCTCGGA TTCCATTGCC CAGCTATCTG 4140 

TCACTTTATT GTGAAGATAG TGGAAAAGGA AGGTGGCTCC TACAAATGCC ATCATTGCGA 4200 

TAAAGGAAAG GCCATCGTTG AAGATGCCTC TGCCGACAGT GGTCCCAAAG ATGGACCCCC 42 60 
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ACCCACGAGG AGCATCGTGG AAAAAGAAGA CGTTCCAACC ACGTCTTCAA AGCAAGTGGA 432 0 

TTGATGTGAT ATCTCCACTG AC GT AAGGGA TGACGCACAA TCCCACTATC CTTCGCAAGA 4380 

CCCTTCCTCT ATATAAGGAA GTTCATTTCA TTTGGAGAGA ACACGGGGGA CTCTAGAGGA 4440 

TCCAGCTGAA GGCTCGACAA GGCAGTCCAC GGAGGAGCTG ATATTTGGTG GACAAGCTGT 4500 

GGATAGGAGC AACCCTATCC CTAATATACC AGCACCACCA AGTCAGGGCA ATCCCCAGAT 4560 

CAAGTGCAAA GGTCCGCCTT GTTTCTCCTC TGTCTCTTGA TCTGACTAAT CTTGGTTTAT 462 0 

GATTCGTTGA GTAATTTTGG GGAAAGCTCC TTTGCTGCTC CACACATGTC CATTCGAATT 4680 

TTACCGTGTT TAGCAAGGGC GAAAAGTTTG CATCTTGATG ATTTAGCTTG ACTATGCGAT 4740 

TGCTTTCCTG GACCCGTGCA GCTGCGCTCG GATCTGGGGC CATTTGTTCC AGGCACGGGA 4800 

TAAGC ATTC A GCCATGGCAG ACGCCAAAAA CATAAAGAAA GGCCCGGCGC CATTCTATCC 4860 

TCTAGAGGAT GGAACCGCTG GAGAGCAACT GCATAAGGCT ATGAAGAGAT ACGCCCTGGT 4920 

TCCTGGAACA ATTGCTTTTA CAGATGCACA TATCGAGGTG AACATCACGT ACGCGGAATA 4980 

CTTCGAAATG TCCGTTCGGT TGGCAGAAGC TATGAAACGA TATGGGCTGA ATACAAATCA 5040 

CAGAATCGTC GTATGCAGTG AAAACTCTCT TCAATTCTTT ATGCCGGTGT TGGGCGCGTT 5100 

ATTTATCGGA GTTGCAGTTG CGCCCGCGAA CGACATTTAT AATGAACGTG AATTGCTCAA 5160 

CAGTATGAAC ATTTCGCAGC CTACCGTAGT GTTTGTTTCC AAAAAGGGGT TGCAAAAAAT 5220 

TTTGAACGTG CAAAAAAAAT TACCAATAAT CCAGAAAATT ATTATCATGG ATTCTAAAAC 5280 

GGATTACCAG GGATTTCAGT CGATGTACAC GTTCGTCACA TCTCATCTAC CTCCCGGTTT 5340 

TAATGAATAC GATTTTGTAC CAGAGTCCTT TGATCGTGAC AAAACAATTG CACTGATAAT 5400 

GAATTCCTCT GGATCTACTG GGTTACCTAA GGGTGTGGCC CTTCCGCATA GAACTGCCTG 5460 

CGTCAGATTC TCGCATGCCA GAGATCCTAT TTTTGGCAAT CAAATCATTC CGGATACTGC 5520 

GATTTTAAGT GTTGTTCCAT TCCATCACGG TTTTGGAATG TTTACTACAC TCGGATATTT 5580 

GATATGTGGA TTTCGAGTCG TCTTAATGTA TAGATTTGAA GAAGAGCTGT TTTTACGATC 5640 

CCTTCAGGAT TACAAAATTC AAAGTGC GTT GCTAGTACCA ACCCTATTTT CATTCTTCGC 57 00 

CAAAAGCACT CTGATTGACA AATACGATTT ATCTAATTTA CACGAAATTG CTTCTGGGGG 57 60 

CGCACCTCTT TCGAAAGAAG TCGGGGAAGC GGTTGCAAAA CGCTTCCATC TTCCAGGGAT 5820 

ACGACAAGGA TATGGGCTCA CTGAGACTAC ATCAGCTATT CTGATTACAC CCGAGGGGGA 5880 

TGATAAACCG GGCGCGGTCG GTAAAGTTGT TCCATTTTTT GAAGCGAAGG TTGTGGATCT 5940 
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GGATACCGGG AAAACGCTGG GCGTTAATCA GAGAGGCGAA TTATGTGTCA GAGGACCTAT 6000 

GATTATGTCC GGTTATGTAA ACAATCCGGA AGCGACCAAC GCCTTGATTG ACAAGGATGG 6060 

ATGGC TAC AT TCTGGAGACA TAGCTTACTG GGACGAAGAC GAACACTTCT TCATAGTTGA 6120 

CCGCTTGAAG TCTTTAATTA AATACAAAGG ATATCAGGTG GCCCCCGCTG AATTGGAATC 6180 

GATATTGTTA CAACACCCCA ACATCTTCGA CGCGGGCGTG GCAGGTCTTC CCGACGATGA 6240 

CGCCGGTGAA CTTCCCGCCG CCGTTGTTGT TTTGGAGCAC GGAAAGACGA TGAC GGAAAA 63 00 

AGAGATCGTG GATTACGTCG CCAGTCAAGT AACAACCGCG AAAAAGTTGC GCGGAGGAGT 63 60 

TGTGTTTGTG GACGAAGTAC CGAAAGGTCT TACCGGAAAA CTCGACGCAA GAAAAATCAG 642 0 

AGAGATCCTC ATAAAGGCCA AGAAGGGCGG AAAGTCCAAA TTGTAAAATG TAACTGTATT 6480 

CAGCGATGAC GAAATTCTTA GCTATTGTAA TCAGATCCGC GAATTTCCCC GATCGTTCAA 6540 

ACATTTGGCA ATAAAGTTTC TTAAGATTGA ATCCTGTTGC CGGTCTTGCG ATGATTATCA 66 00 

TATAATTTCT GTTGAATTAC GTTAAGCATG TAATAATTAA CATGTAATGC ATGACGTTAT 6660 

TTATGAGATG GGTTTTTATG ATTAGAGTCC CGCAATTATA CATTTAATAC GC GAT AG AAA 672 0 

ACAAAATATA GCGCGCAAAC TAGGATAAAT TATCGCGCGC GGTGTCATCT ATGTTACTAG 67 80 

ATCGATCGGG AATTGAGATC TCATATGTCG AGC TCGGGGA TCTCCTTTGC CCCAGAGATC 6840 

ACAATGGACG ACTTCCTCTA TCTCTACGAT CTAGTCAGGA AGTTCGACGG AGAAGGTGAC 69 00 

GATACCATGT TCACCAC TGA TAATGAGAAG ATTAGCCTTT TCAATTTCAG AAAGAATGCT 69 60 

AACCCACAGA TGGTTAGAGA GGCTT AC GC A GCAGGTCTCA TCAAGACGAT CTACCCGAGC 7020 

AATAATCTCC AGGAGATCAA ATACCTTCCC AAGAAGGTTA AAGATGCAGT CAAAAGATTC 7080 

AGGACTAACT GCATCAAGAA CACAGAGAAA GATATATTTC TCAAGATCAG AAGTACTATT 7140 

CCAGTATGGA CGATTCAAGG CTTGCTTCAC AAACCAAGGC AAGTAATAGA GATTGGAGTC 72 00 

TCTAAAAAGG TAGTTCCCAC TGAATCAAAG GCCATGGAGT CAAAGATTCA AATAGAGGAC 72 60 

CTAACAGAAC TCGCCGTAAA GACTGGCGAA CAGTTCCATC GATGATTGAG ACTTTTCAAC 7320 

AAAGGGTAAT ATCCGGAAAC CTCCTCGGAT TCCATTGCCC AGCTATCTGT CACTTTATTG 7380 

TGAAGATAGT GGAAAAGGAA GGTGGCTCCT ACAAATGCCA TCATTGCGAT AAAGGAAAGG 7440 

CCATCGTTGA AGATGCCTCT GCCGACAGTG GTCCCAAAGA TGGACCCCCA CCCACGAGGA 7500 

GCATCGTGGA AAAAGAAGAC GTTCCAACCA CGTCTTCAAA GCAAGTGGAT TGATGTGATA 7560 

TCTCCACTGA CGTAAGGGAT GACGCACAAT CCCACTATCC TTCGCAAGAC CCTTCCTCTA 762 0 

TATAAGGAAG TTCATTTCAT TTGGAGAGGA CACGCTGACA AGCTCGGATC CTTTAGCATG 7680 
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ATTGAACAAG ATGGATTGCA CGCAGGTTCT CCGGCCGCTT GGGTGGAGAG GCTATTCGGC 7740 

TATGAC TGGG CACAACAGAC AATCGGCTGC TCTGATGCCG CCGTGTTCCG GCTGTCAGCG 7800 

CAGGGGCGCC CGGTTCTTTT TGTCAAGACC GACCTGTCCG GTGCCCTGAA TGAACTGCAG 7860 

GAC GAGGC AG CGCGGCTATC GTGGCTGGCC AC GAC GGGCG TTCCTTGCGC AGCTGTGCTC 792 0 

GACGTTGTCA CTGAAGCGGG AAGGGACTGG CTGCTATTGG GCGAAGTGCC GGGGCAGGAT 7980 

CTCCTGTCAT CTCACCTTGC TCCTGCCGAG AAAGTATCCA TCATGGCTGA TGCAATGCGG 8040 

CGGCTGCATA CGCTTGATCC GGCTACCTGC CCATTCGACC ACCAAGCGAA ACATCGCATC 8100 

GAGCGAGCAC GTACTCGGAT GGAAGCC GGT CTTGTCGATC AGGATGATCT GGACGAAGAG 8160 

CATCAGGGGC TCGCGCCAGC CGAACTGTTC GCCAGGCTCA AGGCGCGCAT GCCCGACGGC 8220 

GAGGATCTCG TCGTGACCCA TGGCGATGCC TGCTTGCCGA ATATCATGGT GGAAAATGGC 828 0 

CGCTTTTCTG GATTCATCGA CTGTGGCCGG CTGGGTGTGG CGGACCGCTA TCAGGACATA 8340 

GCGTTGGCTA CCCGTGATAT TGCTGAAGAG CTTGGCGGCG AATGGGC TG A CCGCTTCCTC 8400 

GTGCTTTACG GTATCGCCGC TCCCGATTCG CAGCGCATCG CCTTCTATCG CCTTCTTGAC 8460 

GAGTTCTTCT GAGCGGGACT CTGGGGTTCG AAATGACCGA CCAAGCGACG CCCAACCTGC 8520 

r 

CATCACGAGA TTTCGATTCC ACCGCCGCCT TCTATGAAAG GTTGGGCTTC GGAATCGTTT 858 0 

TCCGGGACGC CGGCTGGATG ATCCTCCAGC GCGGGGATCT CATGCTGGAG TTCTTCGCCC 8640 

ACCCCAACAG AGGTGGATGG ACAGACCCGT TCTTACACCG GACTGGGCGC GGGATAGGAT 8700 

ATTCAGATTG GGATGGGATT GAGCTTAAAG CCGGCGCTGA GACCATGCTC AAGGTAGGCA 8760 

ATGTC CTC AG CGTCGAGCCC GGCATCTATG TCGAGGGCAT TGGTGGAGCG CGCTTCGGGG 8820 

ATACCGTGCT TGTAACTGAG ACCGGATATG AGGCCCTCAC TCCGCTTGAT CTTGGCAAAG 8880 

ATATTTGACG CATTTATTAG TATGTGTTAA TTTTCATTTG CAGTGCAGTA TTTTCTATTC 8940 

GATCTTTATG TAATTCGTTA CAATTAATAA ATATTCAAAT CAGATTATTG ACTGTCATTT 9000 

GTATCAAATC GTGTTTAATG GATATTTTTA TTATAATATT ^GATGATATCT CAATCAAAAC 9060 

GTAGATAATA ATAATATTTA TTTAATATTT TTGCGTCGCA CAGTGAAAAT CTATATGAGA 9120 

TTACAAAATA CCGACAACAT TATTTAAGAT ACATAGACAT TAAC CCTGAG ACTGTTGGAC 9180 

ATCAACGGGT AGATTCCTTC ATGCATAGCA CCTCATTCTT GGGGACAAAA GCACGGTTTG 9240 

GCCGTTCCAT TGCTGCACGA ACGAGCTTTG CTATATCCTC GGGTTGGATC ATCTCATCAG 9300 

GTCCAATCAA ATTTGTCCAA GAACTCATGT TAGTCGCAAC GAAACC GGGG CATATGGTGC 9360 
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ACTCTCAGTA CAATCTGCTC TGATGCCGCA TAGTTAAGCC AGCCCCGACA CCCGCCAACA 942 0 

CCCGCTGACG CGCCCTGACG GGCTTGTCTG CTCCCGGCAT CCGCTTACAG ACAAGCTGTG 9480 

ACCGTCTCCG GGAGCTGCAT GTGTCAGAGG TTTTCACCGT CATCACCGAA ACGCGCGAGA 9540 

CGAAAGGGCC TCGTGATACG CCTATTTTTA TAGGTTAATG TCATGATAAT AATGGTTTCT 9 600 

TAGAC GTC AG GTGGCACTTT TCGGGGAAAT GTGCGCGGAA CCCCTATTTG TTTATTTTTC 9660 

TAAATACATT CAAATATGTA TCCGCTCATG AGACAATAAC CCTGATAAAT GCTTCAATAA 972 0 

TATTGAAAAA GGAAGAGTAT GAGTATTCAA CATTTCCGTG TCGCCCTTAT TCCCTTTTTT 9780 

GCGGCATTTT GCCTTCCTGT TTTTGCTCAC CCAGAAACGC TGGTGAAAGT AAAAGATGCT 9840 

GAAGATCAGT TGGGTGCACG AGTGGGTTAC ATC GAACTGG ATCTCAACAG CGGTAAGATC 9900 

CTTGAGAGTT TTCGCCCCGA AGAAC GTTTT CCAATGATGA GCACTTTTAA AGTTCTGCTA 9960 

TGTGGCGCGG TATTATCCCG TATTGACGCC GGGCAAGAGC AACTCGGTCG CCGCATACAC 10020 

TATTCTCAGA ATGACTTGGT TGAGTACTCA CCAGTCACAG AAAAGCATCT TACGGATGGC 10080 

ATGACAGTAA GAGAATTATG CAGTGCTGCC ATAACCATGA GTGATAACAC TGCGGCCAAC 10140 

TTACTTCTGA CAACGATCGG AGGACCGAAG GAGCTAACCG CTTTTTTGCA CAACATGGGG 10200 

GATCATGTAA CTCGCCTTGA TCGTTGGGAA CCGGAGCTGA ATGAAGC CAT ACCAAACGAC 10260 

GAGCGTGACA CCACGATGCC TGTAGCAATG GCAACAACGT TGCGCAAACT ATTAACTGGC 10320 

GAACTACTTA CTCTAGCTTC CCGGCAACAA TTAATAGACT GGATGGAGGC GGATAAAGTT 10380 

GCAGGACCAC TTCTGCGCTC GGCCCTTCCG GCTGGCTGGT TTATTGCTGA TAAATCTGGA 10440 

GCCGGTGAGC GTGGGTCTCG CGGTATCATT GCAGCACTGG GGCCAGATGG TAAGCCCTCC 10500 

CGTATCGTAG TTATCTACAC GACGGGGAGT CAGGCAACTA TGGATGAACG AAATAGACAG 10560 

ATC GCTGAGA TAGGTGCCTC ACTGATTAAG CATTGGTAAC TGTCAGACCA AGTTTACTCA 10620 

TATATACTTT AGATTGATTT AAAACTTCAT TTTTAATTTA AAAGGATCTA GGTGAAGATC 10680 

CTTTTTGATA ATCTCATGAC CAAAATCCCT TAACGTGAGT TTTCGTTCCA CTGAGCGTCA 10740 

GACCCCGTAG AAAAGATCAA AGGATCTTCT TGAGATCCTT TTTTTCTGCG CGTAATCTGC 10800 

TGCTTGCAAA CAAAAAAACC ACCGCTACCA GCGGTGGTTT GTTTGCCGGA TCAAGAGCTA 10860 

CCAACTCTTT TTCCGAAGGT AACTGGCTTC AGCAGAGCGC AGATACCAAA TACTGTCCTT 10920 

CTAGTGTAGC CGTAGTTAGG CCACCACTTC AAGAACTCTG TAGCACCGCC TACATACCTC 10980 

GCTCTGCTAA TCCTGTTACC AGTGGCTGCT GCCAGTGGCG ATAAGTCGTG TCTTACCGGG 11040 

TTGGACTCAA G AC GAT AGTT ACCGGATAAG GCGCAGCGGT CGGGCTGAAC GGGGGGTTCG 11100 
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TGCACACAGC CCAGCTTGGA GCGAACGACC TACACCGAAC TGAGATACCT ACAGCGTGAG 11160 

CATTGAGAAA GCGCCACGCT TCCCGAAGGG AGAAAGGCGG ACAGGTATCC GGTAAGCGGC 1122 0 

AGGGTCGGAA CAGGAGAGCG CACGAGGGAG CTTCCAGGGG GAAACGCCTG GTATCTTTAT 11280 

AGTCCTGTCG GGTTTCGCCA CCTCTGACTT GAGCGTCGAT TTTTGTGATG CTCGTCAGGG 11340 

GGGCGGAGCC TATGGAAAAA CGCCAGCAAC GCGGCCTTTT TACGGTTCCT GGCCTTTTGC 11400 

TGGCCTTTTG CTCACATGTT CTTTCCTGCG TTATCCCCTG ATTCTGTGGA TAACCGTATT 11460 

ACCGCCTTTG AGTGAGCTGA TACCGCTCGC CGCAGCCGAA CGACCGAGCG CAGCGAGTCA 11520 

GTGAGCGAGG AAGCGGAAGA GCGCCCAATA CGCAAACCGC CTCTCCCCGC GCGTTGGCCG 11580 

ATTCATTAAT GCAGCTGGCA CGACAGGTTT CCCGACTGGA AAGC GGGC AG TGAGCGCAAC 11640 

GCAATTAATG TGAGTTAGCT CACTCATTAG GCACCCCAGG CTTTACACTT TATGCTTCCG 11700 

GCTCGTATGT TGTGTGGAAT TGTGAGCGGA TAACAATTTC ACACAGGAAA CAGCTATGAC 11760 

CATGATTACG CCAAGCTTCC GCGG 11784 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11991 base pairs 1 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

GGGCCCACCA CTGTTGTAAC TTGTAAGCCA CTAGCTCACG TTCTCCATGA GCTCTTCTCT 60 

CTGCTGTTTC TTCCTCTGCT AACTGCGTTA TGATATGACG TCGTATAAAT AATCTCACAA 120 

TACTTCCTTA TTTTCAGCAT GGCCTCTTTT ATGTTTATTT AACAGTAGCA ACCAACGCCG 180 

CTCGATGTTT CCTTCAAGAA ACGGCCACTC ACTATGTGGT GTGCAGAAGA ACAAATGTAA 240 

GCAGCTCCTA C AGGT AC C AG TAGTCATGTC AGTGTGGAAG CTTTCCAACC AACGCCTCCT 300 

TCGAGGAACC TGGTCGTGCT GACATGAATG TAGGCCATGC AAGCACAAGC ACCTAACGCG 3 60 

AATCATCACG ACGCGCCGTG TACTGGGCGT TGGTACATCA CACCCCGCGT TTGACCTGAT 420 

CGGAAGCATG CGTGTGTGTT GGCTGCAGGA CCGGCTATAG GTTTCCTGCA TTGGACAGCA 480 

GAAGCCAGTC ATGTTAGGCA CTCACGCGCT CCTGCCGTTT GATGAATCAT CCGGTCTTTC 540 

GTATTGATCA CTAGTTCACT ACGCTGATAT AGCAAATTTT AAGATGTGAA ACCACGAGAC 600 
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GAGCGATAAA TCTTAGACGT TACCTATCCA TATGAAGCTT GTGCGAAAAA AAGGCGTGCC 660 

GCTGTAGCAT CATTCGTATA CACTTTTGTC CCCAAAGACA GGGATACGAA TCCATGCTCG 72 0 

ACAGAACCCT CCCTTCCCTG CAGATAACGA CACTTAAGTA TAACAAAAGT AGTTGGATTA 780 

TTTCAGAAGC AAAATCTCAC TTTTCGCTGG CCTTTTTGTA CTTTGGTTAC TTGAGTTCAG 840 

ACAGTGTATG CTATATTGTC ATGTGCTGCG TAAGGTTTAA AT ATGGTTC G ACAAATATAT 900 

CAGTATATCA CTACTTTGTT ATGGGTGGGG CCTAGCACAA ACTTGATACA GCTAGGATAA 9 60 

AGTTAGAACG ATGACTGATC TACTGTAAAG CGACACCTGT CCTGTTATGG TAGTTTAAGT 1020 

CCATTCCTGG ACGACTCCAG ATCCAGGATA TGATGCTGTT ACATAATGCG ATTGTTCACA 1080 

ATAAAATTGC ATGATGTTCT TCTACTCTTT AGGCAGTTTT GTTCAACAGG CAAGTTGCAT 1140 

AATGCATGTG CATATATGAG CAGCATAATC ATCAATTAAT C AT AGGTTC G TCATTTTAGT 12 00 

TTCACTCCTT CACATTATTC CAGCCCTTGA AGAAAAATGT AGCAGTGCTT GCTGTTTAAT 12 60 

AAGTGGCAGA GCTGTTTTCA CTCCACCTAC GCTTGTCTAG G AC C AAAATT TTAATCTGTC 1320 

ACTTTGAGCT AAAACTGAAG CACCAAACCG CTACAAAAGA ACGTAGGAGC TGAATTGTAA 1380 

CTTGATGGGA TTACTATAGC AGTTGCTACA GTTCTAGCTA GCTACCTTAT TCTATACGCA 1440 

TCACCCTAAC AACCCGGCTG ACTGCTGCAT CTGACCCCACT CGTCCCCTGC TCCAAACCAA 1500 

CTCTCCTTTC CTTGCATGCA CTACACCCAC TTCCTGCAGC TAT AT AT AC C ACCATATGCC 1560 

CATCTTATGA AACCATCCAC AAGAGGAGAA GAAACAATCA ACCAGCAACA CTCTTCTCTT 1620 

ATAACATAGT ACAGCGAAGG TAACTCACAG TGCAAAGGTC CGCCTTGTTT CTCCTCTGTC 1680 

TCTTGATCTG ACT AATC TTG GTTTATGATT CGTTGAGTAA TTTTGGGGAA AGCTCCTTTG 1740 

CTGCTCCACA CATGTCCATT C G AATTTT AC CGTGTTTAGC AAGGGCGAAA AGTTTGCATC 1800 

TTGATGATTT AGCTTGACTA TGCGATTGCT TTCCTGGACC CGTGCAGCTG CGCTCGTCGA 1860 

CCATGGTCCG TCCTGTAGAA ACCCCAACCC GTGAAATCAA AAAACTC GAC GGCCTGTGGG 1920 

CATTCAGTCT GGATCGCGAA AACTGTGGAA TTGATCAGCG TTGGTGGGAA AGCGCGTTAC 1980 

AAGAAAGCCG GGCAATTGCT GTGCCAGGCA GTTTTAACGA TCAGTTCGCC GATGCAGATA 2040 

TTCGTAATTA TGCGGGCAAC GTCTGGTATC AGCGCGAAGT CTTTATACCG AAAGGTTGGG 2100 

CAGGCCAGCG TATCGTGCTG CGTTTCGATG CGGTCACTCA TTACGGCAAA GTGTGGGTCA 2160 

ATAATCAGGA AGTGATGGAG CATCAGGGCG GCTATACGCC ATTTGAAGCC GATGTCACGC 2220 

CGTATGTTAT TGCCGGGAAA AGTGTACGTA TCACCGTTTG TGTGAACAAC GAACTGAACT 2280 

GGCAGACTAT CCCGCCGGGA ATGGTGATTA CCGACGAAAA CGGCAAGAAA AAGCAGTCTT 2340 
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ACTTCCATGA TTTCTTTAAC TATGCCGGAA TCCATCGCAG CGTAATGCTC TACACCACGC 2400 

CGAACACCTG GGTGGAC GAT ATCACCGTGG TGACGCATGT CGCGCAAGAC TGTAACCACG 24 60 

CGTCTGTTGA CTGGCAGGTG GTGGCCAATG GTGATGTCAG CGTTGAACTG CGTGATGCGG 252 0 

ATCAACAGGT GGTTGCAACT GGACAAGGCA CTAGCGGGAC TTTGCAAGTG GTGAATCCGC 2580 

ACCTCTGGCA ACCGGGTGAA GGTTATCTCT ATGAACTGTG CGTCACAGCC AAAAGCCAGA 2 640 

CAGAGTGTGA TATCTACCCG CTTCGCGTCG GCATCCGGTC AGTGGCAGTG AAGGGCGAAC 2700 

AGTTCCTGAT TAACCACAAA CCGTTCTACT TTACTGGCTT TGGTCGTCAT GAAGATGCGG 27 6 0 

ACTTACGTGG CAAAGGATTC GATAACGTGC TGATGGTGCA CGACCACGCA TTAATGGACT 282 0 

GGATTGGGGC CAACTCCTAC CGTACCTCGC ATTACCCTTA CGCTGAAGAG ATGCTCGACT 288 0 

GGGCAGATGA ACATGGCATC GTGGTGATTG ATGAAACTGC TGCTGTCGGC TTTAACCTCT 2940 

CTTTAGGCAT TGGTTTCGAA GCGGGCAACA AGC CGAAAGA ACTGTACAGC GAAGAGGCAG 3 000 

TCAACGGGGA AACTCAGCAA GCGCACTTAC AGGCGATTAA AGAGCTGATA GCGCGTGACA 3060 

AAAACCACCC AAGCGTGGTG ATGTGGAGTA TTGCCAACGA ACCGGATACC CGTCCGCAAG 312 0 

TGC AC GGGAA TATTTCGCCA CTGGCGGAAG CAACGCGTAA ACTCGACCCG ACGCGTCCGA 3180 

TCACCTGCGT CAATGTAATG TTCTGCGACG CTCACACCGA TACCATCAGC GATCTCTTTG 3240 

ATGTGCTGTG CCTGAACCGT TATTACGGAT GGTATGTCCA AAGCGGCGAT TTGGAAACGG 3300 

CAGAGAAGGT ACTGGAAAAA GAACTTCTGG CCTGGCAGGA GAAACTGCAT CAGCCGATTA 33 60 

TCATCACCGA ATACGGCGTG GAT AC GTT AG CCGGGCTGCA CTCAATGTAC ACCGACATGT 342 0 

GGAGTGAAGA GTATCAGTGT GCATGGCTGG ATATGTATCA CCGCGTCTTT GATCGCGTCA 3480 

GCGCCGTCGT C GGTGAAC AG GTATGGAATT TCGCCGATTT TGCGACCTCG CAAGGCATAT 3540 

TGCGCGTTGG CGGTAACAAG AAAGGGATCT TCACTCGCGA CCGCAAACCG AAGTCGGCGG 3600 

CTTTTCTGCT GCAAAAACGC TGGACTGGCA TGAACTTCGG TGAAAAACCG CAGCAGGGAG 3660 

GCAAACAATG AATCAACAAC TCTCCTGGCG CACCATCGTC GGCTACAGCC TCGGTGGGGA 372 0 

ATTGGAGCTC GAATTTCCCC GATCGTTCAA ACATTTGGCA ATAAAGTTTC TTAAGATTGA 3780 

ATCCTGTTGC CGGTCTTGCG ATGATTATCA TATAATTTCT GTTGAATTAC GTTAAGCATG 3840 

TAATAATTAA CATGTAATGC ATGACGTTAT TTATGAGATG GGTTTTTATG ATTAGAGTCC 39 00 

CGCAATTATA CATTTAATAC GCGATAGAAA ACAAAATATA GCGCGCAAAC TAGGATAAAT 3960 

TATCGCGCGC GGTGTCATCT ATGTT AC TAG ATCGATCGGG AATTAAGCTT AGATCTGCAT 4020 
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GGGTGGAGAC TTTTCAACAA AGGGTAATAT CCGGAAACCT CCTCGGATTC CATTGCCCAG 4080 

CTATCTGTCA CTTTATTGTG AAGATAGTGG AAAAGGAAGG TGGCTCCTAC AAATGCCATC 4140 

ATTGCGATAA AGGAAAGGCC ATCGTTGAAG ATGCCTCTGC CGACAGTGGT CCCAAAGATG 42 00 

GACCCCCACC CACGAGGAGC ATCGTGGAAA AAGAAGACGT TCCAACCACG TCTTCAAAGC 42 60 

AAGTGGATTG ATGTGATCAT CGATGGAGAC TTTTCAACAA AGGGTAATAT CCGGAAACCT 4320 

CCTCGGATTC CATTGCCCAG CTATCTGTCA CTTTATTGTG AAGATAGTGG AAAAGGAAGG 43 80 

TGGCTCCTAC AAATGCCATC ATTGCGATAA AGGAAAGGCC ATCGTTGAAG ATGCCTCTGC 4440 

CGACAGTGGT CCCAAAGATG GACCCCCACC CACGAGGAGC ATCGTGGAAA AAGAAGACGT 4500 

TCCAACCACG TCTTCAAAGC AAGTGGATTG ATGTGATATC TCCACTGACG TAAGGGATGA 4560 

CGCACAATCC CACTATCCTT CGCAAGACCC TTCCTCTATA TAAGGAAGTT CATTTCATTT 462 0 

GGAGAGAACA CGGGGGACTC TAGAGGATCC AGCTGAAGGC TCGACAAGGC AGTCCACGGA 4680 

GGAGCTGATA TTTGGTGGAC AAGC TGTGGA TAGGAGCAAC CCTATCCCTA AT AT AC C AGC 4740 

ACCACCAAGT CAGGGCAATC CCCAGATCAA GTGCAAAGGT CCGCCTTGTT TCTCCTCTGT 4800 

CTCTTGATCT GACTAATCTT GGTTTATGAT TC GTTGAGT A ATTTTGGGGA AAGCTCCTTT 4860 

GCTGCTCCAC ACATGTCCAT TCGAATTTTA CCGTGTTTAG CAAGGGCGAA AAGTTTGCAT 4920 

CTTGATGATT TAGCTTGACT ATGCGATTGC TTTCCTGGAC CCGTGCAGCT GCGCTCGGAT 4980 

CTGGGGCCAT TTGTTCCAGG CACGGGATAA GCATTCAGCC ATGGCAGACG CCAAAAACAT 5040 

AAAGAAAGGC CCGGCGCCAT TCTATCCTCT AGAGGATGGA ACCGCTGGAG AGCAACTGCA 5100 

TAAGGCTATG AAGAGATACG CCCTGGTTCC TGGAACAATT GCTTTTACAG ATGCACATAT 516 0 

CGAGGTGAAC ATCACGTACG CGGAATACTT CGAAATGTCC GTTCGGTTGG CAGAAGCTAT 5220 

GAAACGATAT GGGCTGAATA CAAATCACAG AATCGTCGTA TGCAGTGAAA ACTCTCTTCA 5280 

ATTCTTTATG CCGGTGTTGG GCGCGTTATT TATCGGAGTT GCAGTTGCGC CCGCGAACGA 5340 

CATTTATAAT GAACGTGAAT TGCTCAACAG TATGAACATT TCGCAGCCTA CCGTAGTGTT 5400 

TGTTTCCAAA AAGGGGTTGC AAAAAATTTT GAACGTGCAA AAAAAATTAC CAATAATCCA 5460 

GAAAATTATT ATCATGGATT CTAAAACGGA TTACCAGGGA TTTCAGTCGA TGTACACGTT 5520 

CGTCACATCT CATCTACCTC CCGGTTTTAA TGAATACGAT TTTGT AC C AG AGTCCTTTGA 5580 

TCGTGACAAA ACAATTGCAC TGATAATGAA TTCCTCTGGA TCTACTGGGT TACCTAAGGG 5640 

TGTGGCCCTT CCGCATAGAA CTGCCTGCGT CAGATTCTCG CATGCCAGAG ATCCTATTTT 5700 

TGGCAATCAA ATCATTCCGG ATACTGC GAT TTTAAGTGTT GTTCCATTCC ATCACGGTTT 5760 
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TGGAATGTTT ACTACACTCG GATATTTGAT ATGTGGATTT CGAGTCGTCT TAATGTATAG 582 0 

ATTTGAAGAA GAGCTGTTTT TACGATCCCT TCAGGATTAC AAAATTCAAA GTGCGTTGCT 5880 

AGTACCAACC CTATTTTCAT TCTTCGCCAA AAGCACTCTG ATTGACAAAT ACGATTTATC 5940 

TAATTTACAC GAAATTGCTT CTGGGGGCGC ACCTCTTTCG AAAGAAGTCG GGG AAGCGGT 6000 

TGCAAAACGC TTCCATCTTC CAGGGATACG ACAAGGATAT GGGCTCACTG AGACTACATC 6060 

AGCTATTCTG ATTACACCCG AGGGGGATGA TAAACCGGGC GCGGTCGGTA AAGTTGTTCC 612 0 

ATTTTTTGAA GCGAAGGTTG TGGATC TGGA TACCGGGAAA ACGCTGGGCG TTAATCAGAG 6180 

AGGCGAATTA TGTGTCAGAG GACCTATGAT TATGTCCGGT TATGTAAACA ATCCGGAAGC 6240 

GACCAACGCC TTGATTGACA AGGATGGATG GCTACATTCT GGAGACATAG CTTACTGGGA 6300 

CGAAGACGAA CACTTCTTCA TAGTTGACCG CTTGAAGTCT TTAATTAAAT ACAAAGGATA 63 60 

TCAGGTGGCC CCCGCTGAAT TGGAATC GAT ATTGTTACAA CACCCCAACA TCTTCGACGC 6420 

GGGCGTGGCA GGTCTTCCCG AC GATGAC GC CGGTGAACTT CCCGCCGCCG TTGTTGTTTT 6480 

GGAGCACGGA AAGACGATGA CGGAAAAAGA GATCGTGGAT TACGTCGCCA GTCAAGTAAC 6540 

AACCGCGAAA AAGTTGCGCG GAGGAGTTGT GTTTGTGGAC GAAGTACCGA AAGGTCTTAC 6600 

CGGAAAACTC GACGCAAGAA AAATCAGAGA GATCCTCATA AAGGCCAAGA AGGGCGG AAA 6660 

GTCCAAATTG TAAAATGTAA CTGTATTCAG CGATGACGAA ATTCTTAGCT ATTGTAATCA 6720 

GATCCGCGAA TTTCCCCGAT CGTTCAAACA TTTGGCAATA AAGTTTCTTA AGATTGAATC 6780 

CTGTTGCCGG TCTTGCGATG ATTATCATAT AATTTCTGTT GAATTACGTT AAGCATGTAA 6840 

TAATTAACAT GTAATGCATG ACGTTATTTA TGAGATGGGT TTTTATGATT AGAGTCCCGC 6900 

AATTATACAT TTAATACGCG ATAGAAAACA AAATATAGCG CGCAAACTAG GATAAATTAT 6960 

CGCGCGCGGT GTCATCTATG TTACTAGATC GATCGGGAAT TGAGATCTCA TATGTCGAGC 7020 

TCGGGGATCT CCTTTGCCCC AGAGATCACA ATGGACGACT TCCTCTATCT CTACGATCTA 7080 

GTCAGGAAGT TCGAC GGAGA AGGTGACGAT ACCATGTTCA CCACTGATAA TGAGAAGATT 7140 

AGC CTTTTC A ATTTCAGAAA GAATGCTAAC CCACAGATGG TTAGAGAGGC TTACGCAGCA 7200 

GGTCTCATCA AGACGATCTA CCCGAGCAAT AATCTCCAGG AGATCAAATA CCTTCCCAAG 72 60 

AAGGTTAAAG ATGCAGTCAA AAGATTCAGG ACTAACTGCA TCAAGAACAC AGAGAAAGAT 7320 

ATATTTCTCA AGATCAGAAG TACTATTCCA GTATGGACGA TTCAAGGCTT GCTTCACAAA 7380 

CCAAGGCAAG TAATAGAGAT TGGAGTCTCT AAAAAGGTAG TTCCCACTGA ATCAAAGGCC 7440 
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ATGGAGTCAA 


AGATTCAAAT 


AGAGGACCTA 


ACAGAACTCG 


CCGTAAAGAC 


TGGCGAACAG 


7500 


TTCCATCGAT 


GATTGAGACT 


TTTCAACAAA 


GGGTAATATC 


CGGAAACCTC 


CTCGGATTCC 


7560 


ATTGCCCAGC 


TATCTGTCAC 


TTTATTGTGA 


AGATAGTGGA 


AAAGGAAGGT 


GGCTCCTACA 


7620 


AATGCCATCA 


TTGC GAT AAA 


GGAAAGGCCA 


TCGTTGAAGA 


TGCCTCTGCC 


GACAGTGGTC 


7680 


CCAAAGATGG 


ACCCCCACCC 


AC GAGGAGC A 


TCGTGGAAAA 


AGAAGACGTT 


CCAACCACGT 


7740 


CTTCAAAGCA 


AGTGGATTGA 


TGTGATATCT 


CCACTGACGT 


AAGGGATGAC 


GCACAATCCC 


7800 


ACTATCCTTC 


GCAAGACCCT 


TCCTCTATAT 


AAGGAAGTTC 


ATTTCATTTG 


GAGAGGACAC 


7860 


GCTGACAAGC 


TCGGATCCTT 


TAGCATGATT 


GAACAAGATG 


GATTGCACGC 


AGGTTCTCCG 


7920 


GCCGCTTGGG 


TGGAGAGGCT 


ATTCGGCTAT 


GACTGGGCAC 


AACAGACAAT 


CGGCTGCTCT 


7980 


GATGCCGCCG 


TGTTCCGGCT 


GTCAGCGCAG 


GGGCGCCCGG 


TTCTTTTTGT 


CAAGACCGAC 


8040 


CTGTCCGGTG 


CCCTGAATGA 


ACTGCAGGAC 


GAGGCAGCGC 


GGCTATCGTG 


GCTGGCCACG 


8100 


ACGGGCGTTC 


CTTGCGCAGC 


TGTGCTCGAC 


GTTGTCACTG 


AAGCGGGAAG 


GGACTGGCTG 


8160 


CTATTGGGCG 


AAGTGCCGGG 


GCAGGATCTC 


CTGTCATCTC 


ACCTTGCTCC 


TGCCGAGAAA 


8220 


GT ATC CATC A 


TGGCTGATGC 


AATGC GGCGG 


CTGCATACGC 


TTGATCCGGC 


TACCTGCCCA 


8280 


TTCGACCACC 


AAGCGAAACA 


TC GC ATC GAG 


CGAGCACGTA 


CTCGGATGGA 


AGCCGGTCTT 


8340 


GTCGATCAGG 


ATGATCTGGA 


CGAAGAGCAT 


CAGGGGCTCG 


CGCCAGCCGA 


ACTGTTCGCC 


8400 


AGGCTCAAGG 


CGCGCATGCC 


CGACGGCGAG 


GATCTCGTCG 


TGACCCATGG 


CGATGCCTGC 


8460 


TTGCCGAATA 


TCATGGTGGA 


AAATGGCCGC 


TTTTCTGGAT 


TCATCGACTG 


TGGCCGGCTG 


8520 


GGTGTGGCGG 


ACCGCTATCA 


GGACATAGCG 


TTGGCTACCC 


GTGATATTGC 


TGAAGAGCTT 


8580 


GGCGGCGAAT 


GGGCTGACCG 


CTTCCTCGTG 


CTTT AC GGT A 


TCGCCGCTCC 


CGATTCGCAG 


8640 


CGCATCGCCT 


TCTATCGCCT 


TCTTGAC GAG 


TTCTTCTGAG 


CGGGACTCTG 


GGGTTCGAAA 


8700 


TGACCGACCA 


AGCGACGCCC 


AACCTGCCAT 


CACGAGATTT 


CGATTCCACC 


GCCGCCTTCT 


8760 


ATGAAAGGTT 


GGGCTTCGGA 


ATCGTTTTCC 


GGGACGCCGG 


CTGGATGATC 


CTCCAGCGCG 


8820 


GGGATCTCAT 


GCTGGAGTTC 


TTCGCCCACC 


CCAACAGAGG 


TGGATGGACA 


GACCCGTTCT 


8880 


TACACCGGAC 


TGGGCGCGGG 


ATAGGATATT 


CAGATTGGGA 


TGGGATTGAG 


CTTAAAGCCG 


8940 


GCGCTGAGAC 


CATGCTCAAG 


GTAGGCAATG 


TCCTCAGCGT 


CGAGCCCGGC 


ATCTATGTCG 


9000 


AGGGCATTGG 


TGGAGCGCGC 


TTCGGGGATA 


CCGTGCTTGT 


AACTGAGACC 


GGATATGAGG 


9060 


CCCTCACTCC 


GCTTGATCTT 


GGCAAAGATA 


TTTGACGCAT 


TTATTAGTAT 


GTGTTAATTT 


9120 


TCATTTGCAG TGCAGTATTT 


TCTATTC GAT 


CTTTATGTAA 


TTCGTTACAA 


TTAATAAATA 
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TTCAAATCAG ATTATTGACT GTCATTTGTA TCAAATCGTG TTTAATGGAT ATTTTTATTA 9240 

TAATATTGAT GATATCTCAA TCAAAACGTA GATAATAATA ATATTTATTT AATATTTTTG 93 00 

CGTCGCACAG TGAAAATCTA TATGAGATTA CAAAATACCG ACAACATTAT TTAAGATACA 9360 

TAGACATTAA CCCTGAGACT GTTGGACATC AACGGGTAGA TTCCTTCATG CATAGCACCT 9420 

CATTCTTGGG GACAAAAGCA CGGTTTGGCC GTTCCATTGC TGCACGAACG AGCTTTGCTA 9480 

TATCCTCGGG TTGGATCATC TCATCAGGTC CAATCAAATT TGTCCAAGAA CTCATGTTAG 9540 

TCGCAACGAA ACCGGGGCAT ATGGTGCACT CTCAGTACAA TCTGCTCTGA TGCCGCATAG 9600 

TTAAGCCAGC CCCGACACCC GCCAACACCC GCTGACGCGC CCTGACGGGC TTGTCTGCTC 9 660 

CCGGCATCCG CTTACAGACA AGCTGTGACC GTCTCCGGGA GCTGCATGTG TCAGAGGTTT 9720 

TCACCGTCAT CACCGAAACG CGCGAGACGA AAGGGCCTCG TGATACGCCT ATTTTTATAG 9780 

GTTAATGTCA TGATAATAAT GGTTTCTTAG ACGTCAGGTG GCACTTTTCG GGGAAATGTG 9840 

CGCGGAACCC CTATTTGTTT ATTTTTCTAA ATACATTCAA ATATGTATCC GCTCATGAGA 9900 

CAATAACCCT G AT AAATGC T TCAATAATAT TGAAAAAGGA AGAGTATGAG TATTCAACAT 9960 

TTCCGTGTCG CCCTTATTCC CTTTTTTGCG GCATTTTGCC TTCCTGTTTT TGCTCACCCA 10020 

GAAACGCTGG TGAAAGTAAA AGATGCTGAA GATCAGTTGG GTGCACGAGT GGGTTACATC 10080 

GAACTGGATC TCAACAGCGG TAAGATCCTT GAGAGTTTTC GCCCCGAAGA ACGTTTTCCA 10140 

ATGATGAGCA CTTTTAAAGT TCTGCTATGT GGCGCGGTAT TATCCCGTAT TGACGCCGGG 10200 

CAAGAGCAAC TCGGTCGCCG CATACACTAT TCTCAGAATG ACTTGGTTGA GTACTCACCA 10260 

GTCACAGAAA AGCATCTTAC GGATGGCATG ACAGTAAGAG AATTATGCAG TGCTGCCATA 10320 

ACCATGAGTG ATAACACTGC GGCCAACTTA CTTCTGACAA CGATCGGAGG ACCGAAGGAG 10380 

CTAACCGCTT TTTTGCACAA CATGGGGGAT CATGTAACTC GCCTTGATCG TTGGGAACCG 10440 

GAGCTGAATG AAGCCATACC AAACGACGAG CGTGACACCA CGATGCCTGT AGCAATGGCA 10500 

ACAACGTTGC GCAAACTATT AACTGGC GAA CTACTTACTC TAGCTTCCCG GCAACAATTA 10560 

ATAGACTGGA TGGAGGCGGA TAAAGTTGCA GGACCACTTC TGCGCTCGGC CCTTCCGGCT 10620 

GGCTGGTTTA TTGCTGATAA ATCTGGAGCC GGTGAGCGTG GGTCTCGCGG TATCATTGCA 10680 

GCACTGGGGC CAGATGGTAA GCCCTCCCGT ATCGTAGTTA TCTACACGAC GGGGAGTCAG 10740 

GC AACTATGG ATGAACGAAA TAGACAGATC GCTGAGATAG GTGCCTCACT GATTAAGCAT 10800 

TGGTAACTGT CAGACCAAGT TTACTCATAT ATACTTTAGA TTGATTTAAA ACTTCATTTT 10860 
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TAATTTAAAA GGATCTAGGT GAAGATCCTT TTTGATAATC TCATGACCAA AATCCCTTAA 10920 

CGTGAGTTTT CGTTCCACTG AGCGTCAGAC CCCGTAGAAA AGATCAAAGG ATCTTCTTGA 10980 

GATCCTTTTT TTCTGCGCGT AATCTGCTGC TTGCAAACAA AAAAACCACC GCTACCAGCG 11040 

GTGGTTTGTT TGCCGGATCA AGAGCTACCA ACTCTTTTTC CGAAGGTAAC TGGCTTCAGC 11100 

AGAGCGCAGA TACCAAATAC TGTCCTTCTA GTGTAGCCGT AGTTAGGCCA CCACTTCAAG 11160 

AACTCTGTAG CACCGCCTAC ATACCTCGCT CTGCTAATCC TGTTACCAGT GGCTGCTGCC 11220 

AGTGGCGATA AGTCGTGTCT TACCGGGTTG GACTCAAGAC GATAGTTACC GGATAAGGCG 11280 

CAGCGGTCGG GCTGAACGGG GGGTTCGTGC ACACAGCCCA GCTTGGAGCG AACGACC TAG 11340 

ACC GAACTGA GATACCTACA GCGTGAGCAT TGAGAAAGCG CCACGCTTCC CGAAGGGAGA 11400 

AAGGC GGAC A GGTATCCGGT AAGCGGCAGG GTC GGAAC AG GAGAGCGCAC GAGGGAGCTT 11460 

CCAGGGGGAA ACGCCTGGTA TCTTTATAGT CCTGTCGGGT TTCGCCACCT CTGACTTGAG 11520 

CGTCGATTTT TGTGATGCTC GTCAGGGGGG CGGAGC CT AT GGAAAAACGC CAGCAACGCG 11580 

GCCTTTTTAC GGTTCCTGGC CTTTTGCTGG CCTTTTGCTC ACATGTTCTT TCCTGCGTTA 11640 

TCCCCTGATT CTGTGGATAA C C GT ATT AC C GC C TTTGAGT GAGCTGATAC CGCTCGCCGC 11700 

AGC CGAACGA CCGAGCGCAG CGAGTCAGTG AGCGAGGAAG CGGAAGAGCG CCCAATACGC 11760 

AAACCGCCTC TCCCCGCGCG TTGGCCGATT CATTAATGCA GCTGGCACGA CAGGTTTCCC 11820 

GACTGGAAAG CGGGCAGTGA GCGCAACGCA ATTAATGTGA GTTAGCTCAC TCATTAGGCA 11880 

CCCCAGGCTT TACACTTTAT GCTTCCGGCT CGTATGTTGT GTGGAATTGT GAGCGGATAA 11940 

CAATTTCACA CAGGAAACAG CTATGACCAT GATTACGCCA AGCTTCCGCG G 11991 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

ACGTAC GTAC GGGCCCACCA CTGTTGTAAC TTGTAAGCC * 39 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 32 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:12: 
AGGCGGACCT TTGCACTGTG AGTTACCTTC GC 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 
CTCTGTCGAC GAGCGCAGCT GCACGGGTC 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 2 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
GCGAAGGTAA CTCACAGTGC AAAGGTCCGC CT 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9299 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
GAGCTCCACC GCGGTGGCGG CCGCTCTAGA ACTAGTGGAT CCGTCGACCA TGGCCAGTTG 
CCGGTGGAGC AGGTAAAAAC ACCGTAGCGT AGCAGCCAGG CGGAAGCAGA CGCACAGCAC 
AGGTTGGTTA TGATAGTCAG CCGGGCCACA TGTGTGTAGT TGGTACACTG ATACGCTTAC 
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ACTGTCTCTC CTTTCTTTTT TATTTGTCAC CTTTGGTC G A GCTTACATAA TTGTGTGACT 240 

AAAAAAAGGT CACTTCATTC AGAAATTTAG GGTTGTGGGA ATTTTGGATT TTATTGTGTC 300 

TGTATAGAGT AGCTATAGCT AGCTAGCTAG ATGTGATGTT AATAATTATG AC GATGAG AT 360 

TGGCCCGCTT GGCCGCTTGC ATTGTCTCCC TAGCTCAATA ATGTTTTGAG TTTGTCTTGC 42 0 

CTTTCTTTCA GCTCTAACAA ATTGGAGTAG GGATGACTGA GATACATATA TAAAAGCGAA 480 

AACCGCTGCT CTCTGTTAAT TATTGCACAT CACACATAGG CCAAGCCTTA AGGACAATCA 540 

ACTAAGGATG GTAATAACTA AGGCTAGTGA GGTCGAACTA GGGATGTTAA TATACTCTAG 600 

ATTTTAGACT ATAAAATTTA AGGATCGAAT CAGATTAGTA TCGAACTATA TTTATATTCA 660 

TTTCTAAACT AAATTAATTA AGCACCCTAA ATTATTGTGA TGAAGAGACA TTTCGATCGT 72 0 

GATCCATTAT TACTCCTTGG TCAAACTAAT CTCGTTTTAT GTCACTATTT CATCATCTTT 7 80 

TTTGCGAACG GGTTTATAGC CCGTGTTCCA TTATGAGGAC ATGAACGGTT TAAACAAAGT 840 

TACATATCAT CCCAGCTAGC TACCTAGATT GGAAGCATGG GTTCGGTATA TATATATAGT 900 

TTATATATTT GGTATATATA TATATATATA TATATATATA TATATATCAC ACGTCAGCTT 9 60 

ATATTACGTA AAGTGGGGTT AGTTTTCAAG AAGCGTGGGA CCAGTCACCT CTGCAGTCTG 102 0 

ACCTTGGCTT CAGCTTCGAC AGCAAACAGT CATCTCTTGG AAGCTAAGGA CAGTCTCCAA 1080 

CAGTCAACAA AGCAGCGGTC TGCTTGTAGT TCTCCCTTGC ACGACCAGCT ATATCTAGCA 1140 

TCATAACAAC GGTAAGATCA TCTCTAGCAC GACAAACTTA GTTTAATTAA TTATGTCTAA 12 00 

TCCGTTGTTG TTAGCTTAAA CTTTCTAGCC TCCTATGCTA AGAGAGTTCT CTAGTTCTAC 1260 

TCAGGTGGAT TGATATATAA ATTGGGAATC TTCTAGGCGT CACAAGGTAT GGTACACATC 132 0 

AATCAATGAA CGGACAAAGC AACGGTAAGA TCCGACCCAG TAAAAGTAAT AGCGTTAGGG 1380 

CATGTACAAC CTAGACACTG ATGCACAGTA CTCCAAGTAT AAGACACAAC TAAAACACAA 1440 

CATAATAATA CAGTGGTTAT ATCTAAAACA TGTGTCTTAC CATATTCATT GTAC C AATTA 1500 

GAACATTTAA TAAATTAAAG TGACCAATCA GCTAGCCTCC TGTCTCGAAC ATAGAGCTAA 1560 

GACATTGTGT CTTCGTCAAG ATACATGTCT TAAGTTTTTT TATATTCACT CCCAAAGACA 1620 

CACTCTAAGA CACAACGTAA CACACCCATT GTACATGCTC TTAACCTAAG TTATCATGGA 1680 

TGACCACGCG TGGCAATTAA AAAAATAATT TTTGCCTCCT AAAACCTCTT TCTTAATTGG 1740 

TTCTTGCTTG CAAATCACCA GCGAACCCAT ATGAAAGGAT GCTCAAAATC TGGCCACCGC 1800 

ATCAGGGTTG GTGAATGCAA VGTAAAAAAT AATGCATAAA TCAGCTCTCT GATCAGTTAT 1860 

ATAATCGTGC CTTTT AATTA TTCATGCCAG CTTTATCTGA CTCACGAAAT CATTGATAAA 1920 
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TTATTCCTCA GCTGTATTAG AAAGAGCAGT GTTGTTTAAC TTGGAAAGTG ATGTGGAAGC 1980 

GTGTGATTGC GGTTGAGCTT GTATAGGAGT AAAATGAGGA ACAGTAGGAA AATAATTTTT 2 040 

TCGGATTAAA ACCGGTTGTT TGGACTGCGG CAGATACAAT TCATAGAGAT AAAAACACCG 2100 

TAGAAGTATT AGAAGC CGAT AAAGATTAAA CCCAAATGAA CGAACAGGCT AAACAAATCC 2160 

GGCGCCTCAA AAGTCAAGAG CAGGTACTGG GCTGTCTTGC ACACGTCGCT TTTTGTCTCC 222 0 

CCCTGGCCCC TGGGTGAGAG TAGTAGGGAT GCTAAAGTTT GCTTTCTCTT TTTGAGGCAT 2280 

GTGATAGGCT CTTGTTAGTT GCTAGGGCTA TGTTTATAAT ATTTGCGCTT TTACCTATGT 2340 

AC GTAAGAAC CGGATGGAAT AATGCTATGC AGGAACCAAT TATGTTTGGT CGAAATATAT 2400 

AGTGACCTAT CATAATGTTA TCCCTGTTCA TGTACCTAGG TGGCTAATGA TAT AC GGC AT 2460 

ATGAATACAG TAATCATCCA AGC AC GTAAA AACTCGCTAG ACGTTTATGC CTGCTAGCCT 252 0 

GCTGGGTGTG TAGACTGGAG TACTGGACAA ACATCGCAAT ACAGAGGTAC AGTATTTGTC 258 0 

TAGACAATGA TATACATAGA TAAAAAC C AC TGTTGTAACT TGTAAGCCAC TAGCTCACGT 2 640 

TCTCCATGAG CTCTTCTCTC TGCTGTTTCT TCCTCTGCTA ACTGCGTTAT GATATGACGT 27 00 

CGTATAAATA ATCTCACAAT ACTTCCTTAT TTTCAGCATG GCCTCTTTTA TGTTTATTTA 27 60 

ACAGTAGCAA CCAACGCCGC TCGATGTTTC CTTCAAGAAA CGGCCACTCA CTATGTGGTG 282 0 

TGCAGAAGAA CAAATGTAAG CAGCTCCTAC AGGTACCAGT AGTCATGTCA GTGTGGAAGC 2880 

TTTCCAACCA ACGCCTCCTT CGAGGAACCT GGTCGTGCTG ACATGAATGT AGGCCATGCA 2940 

AGCACAAGCA CCTAACGCGA ATCATCACGA CGCGCCGTGT ACTGGGCGTT GGTACATCAC 3000 

ACCCCGCGTT TGACCTGATC GGAAGCATGC GTGTGTGTTG GCTGCAGGAC C GGCTATAGG 3060 

TTTCCTGCAT TGGACAGCAG AAGCCAGTCA TGTTAGGCAC TCACGCGCTC CTGCCGTTTG 312 0 

ATGAATCATC CGGTCTTTCG TATTGATCAC TAGTTCACTA CGCTGATATA GCAAATTTTA 3180 

AGATGTGAAA CCACGAGACG AGCGATAAAT CTTAGACGTT ACCTATCCAT ATGAAGCTTG 3240 

TGCGAAAAAA AGGCGTGCCG CTGTAGCATC ATTCGTATAC ACTTTTGTCC CCAAAGACAG 3300 

GGATACGAAT CCATGCTCGA CAGAACCCTC CCTTCCCTGC AGATAACGAC ACTTAAGTAT 3360 

AACAAAAGTA GTTGGATTAT TTCAGAAGCA AAATCTCACT TTTCGCTGGC CTTTTTGTAC 342 0 

TTTGGTTACT TGAGTTCAGA CAGTGTATGC TATATTGTCA TGTGCTGCGT AAGGTTTAAA 3480 

TATGGTTCGA CAAATATATC AGTATATCAC TACTTTGTTA TGGGTGGGGC CT AGC AC AAA 3540 

CTTGATACAG CTAGGATAAA GTTAGAACGA TGACTGATCT ACTGTAAAGC GACACCTGTC 3600 
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CTGTTATGGT AGTTTAAGTC CATTCCTGGA CGACTCCAGA TCCAGGATAT GATGCTGTTA 3 660 

CATAATGCGA TTGTTCACAA TAAAATTGCA TGATGTTCTT CTACTCTTTA GGCAGTTTTG 372 0 

TTCAACAGGC AAGTTGCATA ATGCATGTGC ATATATGAGC AGCATAATCA TCAATTAATC 3780 

ATAGGTTCGT CATTTTAGTT TCACTCCTTC ACATTATTCC AGCCCTTGAA GAAAAATGTA 3840 

GCAGTGCTTG CTGTTTAATA AGTGGCAGAG CTGTTTTCAC TCCACCTACG CTTGTCTAGG 3900 

ACCAAAATTT TAATCTGTCA CTTTGAGCTA AAACTGAAGC ACCAAACCGC TACAAAAGAA 3960 

CGTAGGAGCT GAATTGTAAC TTGATGGGAT TACTATAGCA GTTGCTACAG TTCTAGCTAG 4020 

CTACCTTATT C T AT ACGC AT CACCCTAACA ACCCGGCTGA CTGCTGCATC TGACCCCACC 4080 

GTCCCCTGCT CCAAACCAAC TCTCCTTTCC TTGCATGCAC TACACCCACT TCCTGCAGCT 4140 

ATATATACCA CCATATGCCC ATCTTATGAA ACCATCCACA AGAGGAGAAG AAACAATCAA 4200 

CCAGCAACAC TCTTCTCTTA TAACATAGTA CAGCGAAGGT AACTCACATG GCAACTTCCA 42 60 

TGGTCCGTCC TGTAGAAACC CCAACCCGTG AAATCAAAAA ACTCGACGGC CTGTGGGCAT 4320 

TCAGTCTGGA TCGCGAAAAC TGTGGAATTG ATCAGCGTTG GTGGGAAAGC GCGTTACAAG 4380 

AAAGCCGGGC AATTGCTGTG CCAGGCAGTT TTAACGATCA GTTCGCCGAT GCAGATATTC 4440 

GTAATTATGC GGGCAACGTC TGGTATCAGC GCGAAGTCTT TATACCGAAA GGTTGGGCAG 4500 

GCCAGCGTAT CGTGCTGCGT TTCGATGCGG TCACTCATTA CGGCAAAGTG TGGGTCAATA 4560 

ATCAGGAAGT GATGGAGCAT CAGGGCGGCT ATACGC CATT TGAAGC CGAT GTCACGCCGT 462 0 

ATGTTATTGC CGGGAAAAGT GTAC GTATC A CCGTTTGTGT GAACAACGAA CTGAACTGGC 4680 

AGACTATCCC GCCGGGAATG GTGATTACCG ACGAAAACGG CAAGAAAAAG CAGTCTTACT 4740 

TCCATGATTT CTTTAACTAT GCCGGAATCC ATCGCAGCGT AATGCTCTAC ACCACGCCGA 4800 

ACACCTGGGT GGACGATATC ACCGTGGTGA CGCATGTCGC GCAAGACTGT AACCACGCGT 4860 

CTGTTGACTG GCAGGTGGTG GCCAATGGTG ATGTCAGCGT TGAACTGCGT GATGCGGATC 4920 

AACAGGTGGT TGCAACTGGA CAAGGCACTA GCGGGACTTT GCAAGTGGTG AATCCGCACC 4980 

TCTGGCAACC GGGTGAAGGT TATCTCTATG AACTGTGCGT CACAGCCAAA AGCCAGACAG 5040 

AGTGTGATAT CTACCCGCTT CGCGTCGGCA TCCGGTCAGT GGCAGTGAAG GGCGAACAGT 5100 

TCCTGATTAA CCACAAACCG TTCTACTTTA CTGGC TTTGG TCGTCATGAA GATGCGGACT 5160 

TACGTGGCAA AGGATTCGAT AACGTGCTGA TGGTGCACGA CCACGCATTA ATGGACTGGA 5220 

TTGGGGCCAA CTCCTACCGT ACCTCGCATT ACCCTTACGC TGAAGAGATG CTCGACTGGG 5280 

CAGATGAACA TGGCATCGTG GTGATTGATG AAACTGCTGC TGTCGGCTTT AACCTCTCTT 5340 
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TAGGCATTGG TTTCGAAGCG GGCAACAAGC CGAAAGAACT GTACAGCGAA GAGGCAGTCA 5400 

ACGGGGAAAC TCAGCAAGCG CACTTACAGG CGATTAAAGA GCTGATAGCG CGTGACAAAA 5460 

ACCACCCAAG CGTGGTGATG TGGAGTATTG CCAACGAACC GGATACCCGT CCGCAAGTGC 552 0 

ACGGGAATAT TTCGCCACTG GCGGAAGCAA CGCGTAAACT CGACCCGACG CGTCCGATCA 5580 

CCTGCGTCAA TGTAATGTTC TGCGACGCTC ACACCGATAC CATCAGCGAT CTCTTTGATG 5640 

TGCTGTGCCT GAACCGTTAT TACGGATGGT ATGTCCAAAG CGGCGATTTG GAAACGGCAG 57 00 

AGAAGGTACT GGAAAAAGAA CTTCTGGCCT GGCAGGAGAA ACTGCATCAG CCGATTATCA 5760 

TCACCGAATA CGGCGTGGAT ACGTTAGCCG GGCTGCACTC AATGTACACC GACATGTGGA 582 0 

GTGAAGAGTA TCAGTGTGCA TGGC TGGAT A TGTATCACCG CGTCTTTGAT CGCGTCAGCG 588 0 

CCGTCGTCGG TGAACAGGTA TGGAATTTCG CCGATTTTGC GACCTCGCAA GGCATATTGC 5940 

GCGTTGGCGG TAACAAGAAA GGGATCTTCA CTCGCGACCG CAAACCGAAG TCGGCGGCTT 6000 

TTCTGCTGCA AAAACGCTGG ACTGGCATGA ACTTCGGTGA AAAACCGCAG CAGGGAGGCA 6060 

AACAATGAAT CAACAACTCT CCTGGCGCAC CATCGTCGGC TACAGCCTCG GTGGGGAATT 6120 

GGAGC TCGAA TTTCCCCGAT CGTTCAAACA TTTGGCAATA AAGTTTCTTA AGATTGAATC 6180 

CTGTTGCCGG TCTTGCGATG ATTATCATAT AATTTCTGTT GAATTACGTT AAGCATGTAA 6240 

TAATTAACAT GTAATGCATG AC GTT ATTTA TGAGATGGGT TTTTATGATT AGAGTCCCGC 63 00 

AATTATACAT TTAATACGCG ATAGAAAACA AAATATAGCG CGCAAACTAG GATAAATTAT 63 60 

CGCGCGCGGT GTCATCTATG TTACTAGATC GATCGGGAAT TAAGCTTATC GATACCGTCG 6420 

ACCTCGAGGG GGGGCCCGGT ACCCAATTCG CCCTATAGTG AGTCGTATTA CAATTCACTG 6480 

GCCGTCGTTT TACAACGTCG TGACTGGGAA AACCCTGGCG TTACCCAACT TAATCGCCTT 6540 

GCAGCACATC CCCCTTTCGC CAGCTGGCGT AATAGCGAAG AGGCCCGCAC CGATCGCCCT 6600 

TCCCAACAGT TGCGCAGCCT GAATGGCGAA TGGCGCGAAA TTGTAAACGT TAATATTTTG 6660 

TTAAAATTCG CGTTAAATTT TTGTTAAATC AGCTCATTTT TTAACCAATA GGCCGAAATC 672 0 

GGCAAAATCC CTTATAAATC AAAAGAATAG ACCGAGATAG GGTTGAGTGT TGTTCCAGTT 6780 

TGGAACAAGA GTCCACTATT AAAGAACGTG GACTCCAACG TCAAAGGGCG AAAAACCGTC 6840 

TATCAGGGCG ATGGCCCACT ACGTGAACCA TCACCCTAAT CAAGTTTTTT GGGGTCGAGG 6900 

TGCCGTAAAG CACTAAATCG GAACCCTAAA GGGAGCCCCC GATTTAGAGC TTGACGGGGA 6960 

AAGCCGGCGA ACGTGGC GAG AAAGGAAGGG AAGAAAGCGA AAGGAGCGGG CGCTAGGGCG 7020 
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CTGGCAAGTG -T AGC GGTC AC GCTGCGCGTA ACCACCACAC CCGCCGCGCT TAATGCGCCG 7080 

CTACAGGGCG CGTCCCAGGT GGC AC TTTTC GGGGAAATGT GCGCGGAACC CCTATTTGTT 7140 

TATTTTTCTA AATACATTCA AATATGTATC CGCTCATGAG ACAATAACCC TGATAAATGC 72 00 

TTCAATAATA TTGAAAAAGG AAGAGTATGA GTATTCAACA TTTCCGTGTC GCCCTTATTC 72 60 

CCTTTTTTGC GGCATTTTGC CTTCCTGTTT TTGCTCACCC AGAAACGCTG GTGAAAGTAA 732 0 

AAGATGCTGA AGATCAGTTG GGTGCACGAG TGGGTTACAT CGAACTGGAT CTCAACAGCG 7380 

GTAAGATCCT TGAGAGTTTT CGCCCCGAAG AACGTTTTCC AATGATGAGC ACTTTTAAAG 7440 

TTCTGCTATG TGGCGCGGTA TTATCCCGTA TTGACGCCGG GCAAGAGCAA CTCGGTCGCC 7500 

GCATACACTA TTCTCAGAAT GACTTGGTTG AGTACTCACC AGTCACAGAA AAGCATCTTA 7560 

CGGATGGCAT GACAGTAAGA GAATTATGCA GTGCTGCCAT AACCATGAGT GATAACACTG 7620 

CGGCCAACTT ACTTCTGACA ACGATCGGAG GACCGAAGGA GCTAACCGCT TTTTTGCACA 7680 

ACATGGGGGA TCATGTAACT CGCCTTGATC GTTGGGAACC GGAGCTGAAT GAAGC CAT AC 7740 

CAAACGACGA GCGTGACACC ACGATGC CTG TAGCAATGGC AACAACGTTG CGCAAACTAT 7800 

TAACTGGCGA ACTACTTACT CTAGCTTCCC GGCAACAATT AATAGACTGG ATGGAGGCGG 7860 

ATAAAGTTGC AGGACCACTT CTGCGCTCGG CCCTTCCGGC TGGCTGGTTT ATTGCTGATA 7920 

AATCTGGAGC CGGTGAGCGT GGGTCTCGCG GTATCATTGC AGCACTGGGG C C AG ATGGTA 7980 

AGCCCTCCCG TATCGTAGTT ATCTAC AC GA CGGGGAGTCA GGCAACTATG GATGAACGAA 8040 

ATAGACAGAT CGCTGAGATA GGTGCCTCAC TGATTAAGCA TTGGTAACTG TCAGACCAAG 8100 

TTTACTCATA TATACTTTAG ATTGATTTAA AACTTCATTT TTAATTTAAA AGGATCTAGG 8160 

TGAAGATCCT TTTTGATAAT CTCATGACCA AAATCCCTTA ACGTGAGTTT TCGTTCCACT 8220 

GAGCGTCAGA CCCCGTAGAA AAGATCAAAG GATCTTCTTG AGATCCTTTT TTTCTGCGCG 8280 

TAATCTGCTG CTTGCAAACA AAAAAACCAC CGCTACCAGC GGTGGTTTGT TTGCC GGATC 8340 

AAGAGCTACC AACTCTTTTT CCGAAGGTAA CTGGCTTCAG CAGAGCGCAG ATACCAAATA 8400 

CTGTCCTTCT AGTGTAGCCG TAGTTAGGCC ACCACTTCAA GAACTCTGTA GCACCGCCTA 8460 

CATACCTCGC TCTGCTAATC CTGTTACCAG TGGCTGCTGC CAGTGGCGAT AAGTCGTGTC 8520 

TTACCGGGTT GGACTCAAGA C G AT AGTT AC CGGATAAGGC GCAGCGGTCG GGCTGAACGG 8580 

GGGGTTCGTG CACACAGCCC AGCTTGGAGC GAACGACCTA CACCGAACTG AGATACCTAC 8640 

AGCGTGAGCT ATGAGAAAGC GCCACGCTTC CCGAAGGGAG AAAGGCGGAC AGGTATCCGG 8700 

TAAGCGGCAG GGTCGGAACA GG AG AGC GC A CGAGGGAGCT TCCAGGGGGA AACGC CTGGT 8760 
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ATCTTTATAG TCCTGTCGGG TTTCGCCACC TCTGACTTGA GCGTCGATTT TTGTGATGCT 882 0 

CGTCAGGGGG GCGGAGCCTA TGGAAAAACG CCAGCAACGC GGCCTTTTTA CGGTTCCTGG 8880 

CCTTTTGCTG GCCTTTTGCT CACATGTTCT TTCCTGCGTT ATCCCCTGAT TCTGTGGATA 8940 

ACC GTATTAC CGCCTTTGAG TGAGCTGATA CCGCTCGCCG CAGCCGAACG ACCGAGCGCA 9000 

GCGAGTCAGT GAGCGAGGAA GCGGAAGAGC GCCCAATACG CAAACCGCCT CTCCCCGCGC 9060 

GTTGGC CGAT TCATTAATGC AGCTGGCACG ACAGGTTTCC CGACTGGAAA GCGGGCAGTG 9120 

AGCGCAACGC AATTAATGTG AGTTAGC TC A CTCATTAGGC ACCCCAGGCT TTACACTTTA 918 0 

TGCTTCCGGC TCGTATGTTG TGTGGAATTG TGAGC GGAT A ACAATTTCAC ACAGGAAACA 9240 

GCTATGAC C A TGATTACGCC AAGCTCGGAA TTAACCCTCA CTAAAGGGAA CAAAAGCTG 9299 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9408 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

GAGCTCCACC GCGGTGGCGG CCGCTCTAGA ACTAGTGGAT CCTCTAGAGT CGACCATGGC 60 

CAGTTGCCGG TGGAGCAGGT AAAAACACCG TAGCGTAGCA GCCAGGCGGA AGCAGACGCA 120 

CAGCACAGGT TGGTTATGAT AGTCAGCCGG GCCACATGTG TGTAGTTGGT ACACTGATAC 180 

GCTTACACTG TCTCTCCTTT CTTTTTTATT TGTCACCTTT GGTCGAGCTT ACATAATTGT 240 

GTGACTAAAA AAAGGTCACT TCATTCAGAA ATTTAGGGTT GTGGGAATTT TGGATTTTAT 300 

TGTGTCTGTA TAGAGTAGCT ATAGCTAGCT AGCTAGATGT GATGTTAATA ATTATGACGA 360 

TGAGATTGGC CCGCTTGGCC GCTTGCATTG TCTCCCTAGC TCAATAATGT TTTGAGTTTG 420 

TCTTGCCTTT CTTTCAGCTC TAACAAATTG GAGTAGGGAT GACTGAGATA CATATATAAA 480 

AGCGAAAACC GCTGCTCTCT GTTAATTATT GCACATCACA CATAGGCCAA GCCTTAAGGA 540 

CAATCAACTA AGGATGGTAA TAACTAAGGC TAGTGAGGTC GAACTAGGGA TGTTAATATA 600 

CTCTAGATTT TAGACTATAA AATTTAAGGA TCGAATCAGA TTAGTATCGA ACTATATTTA 660 

TATTCATTTC TAAACTAAAT TAATTAAGCA CCCTAAATTA TTGTGATGAA GAGACATTTC 720 

GATCGTGATC CATTATTACT CCTTGGTCAA ACTAATCTCG TTTTATGTCA CTATTTCATC 780 
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ATC TTTTTTG CGAACGGGTT TATAGCCCGT GTTCCATTAT GAGGACATGA ACGGTTTAAA 840 

C AAAGTT AC A T ATC ATC CCA GCTAGCTACC TAGATTGGAA GCATGGGTTC GGTATATATA 900 

TATAGTTTAT ATATTTGGTA TATATATATA TATATATATA TATATATATA TATCACACGT 9 60 

CAGCTTATAT TACGTAAAGT GGGGTTAGTT TTCAAGAAGC GTGGGAC C AG TCACCTCTGC 102 0 

AGTCTGACCT TGGCTTCAGC TTCGACAGCA AACAGTCATC TCTTGGAAGC TAAGGACAGT 1080 

CTCCAACAGT CAACAAAGCA GCGGTCTGCT TGTAGTTCTC CCTTGCACGA CCAGCTATAT 1140 

CTAGCATCAT AACAACGGTA AGATCATCTC TAGCACGACA AACTTAGTTT AATTAATTAT 12 00 

GTCTAATCCG TTGTTGTTAG CTTAAACTTT CTAGCCTCCT ATGCTAAGAG AGTTCTCTAG 12 60 

TTCTACTCAG GTGGATTGAT ATATAAATTG GGAATCTTCT AGGCGTCACA AGGTATGGTA 132 0 

CACATCAATC AATGAACGGA CAAAGCAACG GTAAGATCCG ACCCAGTAAA AGTAATAGCG 13 80 

TTAGGGCATG TACAACCTAG ACACTGATGC ACAGTACTCC AAGTATAAGA CACAACTAAA 1440 

ACACAACATA ATAATACAGT GGTTATATCT AAAACATGTG TCTTACCATA TTCATTGTAC 1500 

CAATTAGAAC ATTTAATAAA TTAAAGTGAC CAATCAGCTA GCCTCCTGTC TC G AAC AT AG 1560 

AGC TAAGAC A TTGTGTCTTC GTCAAGATAC ATGTCTTAAG TTTTTTTATA TTCACTCCCA 1620 

AAGACACACT CTAAGACACA ACGTAACACA CCCATTGTAC ATGCTCTTAA CCTAAGTTAT 1680 

CATGGATGAC CACGCGTGGC AATTAAAAAA ATAATTTTTG CCTCCTAAAA CCTCTTTCTT 1740 

AATTGGTTCT TGCTTGCAAA TCACCAGCGA ACCCATATGA AAGGATGCTC AAAATCTGGC 1800 

CACCGCATCA GGGTTGGTGA ATGCAAVGTA AAAAATAATG CATAAATCAG CTCTCTGATC 1860 

AGTTATATAA TCGTGCCTTT TAATTATTCA TGCCAGCTTT ATC TGACTC A CGAAATCATT 1920 

GATAAATTAT TCCTCAGCTG TATTAGAAAG AGCAGTGTTG TTTAACTTGG AAAGTGATGT 1980 

GGAAGC GTGT GATTGCGGTT GAGCTTGTAT AGGAGTAAAA TGAGGAACAG TAGGAAAATA 2040 

ATTTTTTCGG ATTAAAACCG GTTGTTTGGA CTGCGGCAGA TACAATTCAT AGAGATAAAA 2100 

ACACCGTAGA AGTATTAGAA GCCGATAAAG ATTAAACCCA AATGAACGAA CAGGCTAAAC 2160 

AAATCCGGCG CCTCAAAAGT CAAGAGCAGG TACTGGGCTG TCTTGCACAC GTCGCTTTTT 2220 

GTCTCCCCCT GGCCCCTGGG TGAGAGTAGT AGGGATGCTA AAGTTTGCTT TCTCTTTTTG 22 80 

AGGCATGTGA TAGGCTCTTG TTAGTTGCTA GGGCTATGTT TATAATATTT GCGCTTTTAC 2340 

CTATGTACGT AAGAACCGGA TGGAATAATG CTATGCAGGA ACCAATTATG TTTGGTC GAA 2400 

ATATATAGTG ACCTATCATA ATGTTATCCC TGTTCATGTA CCTAGGTGGC TAATGATATA 2460 

CGGCATATGA ATACAGTAAT CATCCAAGCA CGTAAAAACT CGCTAGACGT TTATGCCTGC 2520 
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TAGCCTGCTG GGTGTGTAGA CTGGAGTACT GGACAAACAT CGCAATACAG AGGTACAGTA 2580 

TTTGTCTAGA CAATGATATA CATAGATAAA AACCACTGTT GTAACTTGTA AGCCACTAGC 2 640 

TCACGTTCTC CATGAGCTCT TCTCTCTGCT GTTTCTTCCT CTGCTAACTG CGTTATGATA 2700 

TGAC GTCGT A TAAATAATCT CACAATACTT CCTTATTTTC AGCATGGCCT CTTTTATGTT 27 60 

TATTTAACAG TAGCAACCAA CGCCGCTCGA TGTTTCCTTC AAGAAACGGC CACTCACTAT 282 0 

GTGGTGTGCA GAAGAACAAA TGTAAGCAGC TCCTACAGGT ACCAGTAGTC ATGTCAGTGT 2880 

GGAAGC TTTC CAACCAACGC CTCCTTCGAG GAACCTGGTC GTGCTGACAT GAATGTAGGC 2940 

CATGCAAGCA CAAGCACCTA ACGCGAATCA TCACGACGCG CCGTGTACTG GGCGTTGGTA 3000 

CATCACACCC CGCGTTTGAC CTGATCGGAA GCATGCGTGT GTGTTGGCTG CAGGACCGGC 3060 

TATAGGTTTC CTGCATTGGA CAGCAGAAGC CAGTCATGTT AGGCACTCAC GCGCTCCTGC 3120 

CGTTTGATGA ATCATCCGGT CTTTCGTATT GATCACTAGT TCACTACGCT GATATAGCAA 3180 

ATTTTAAGAT GTGAAACCAC GAGACGAGCG ATAAATCTTA GACGTTACCT ATCCATATGA 3240 

AGCTTGTGCG AAAAAAAGGC GTGCCGCTGT AGCATCATTC GTATACACTT TTGTCCCCAA 33 00 

AGACAGGGAT ACGAATC C AT GCTCGACAGA ACCCTCCCTT CCCTGCAGAT AACGACACTT 33 60 

AAGTATAACA AAAGTAGTTG GATTATTTCA GAAGCAAAAT CTCACTTTTC GCTGGCCTTT 3420 

TTGTACTTTG GTTACTTGAG TTCAGACAGT GTATGCTATA TTGTCATGTG CTGCGTAAGG 3480 

TTTAAATATG GTTCGACAAA TATATCAGTA TATCACTACT TTGTTATGGG TGGGGCCTAG 3540 

CACAAACTTG ATACAGCTAG GATAAAGTTA GAACGATGAC TGATCTACTG TAAAGCGACA 3600 

CCTGTCCTGT TATGGTAGTT TAAGTCCATT CCTGGACGAC TCCAGATCCA GGATATGATG 3 660 

CTGTTACATA ATGCGATTGT TCACAATAAA ATTGCATGAT GTTCTTCTAC TCTTTAGGCA 3720 

GTTTTGTTCA ACAGGCAAGT TGCATAATGC ATGTGCATAT ATGAGCAGCA TAATCATCAA 3780 

TTAATCATAG GTTCGTCATT TTAGTTTCAC TCCTTCACAT TATTCCAGCC CTTGAAGAAA 3840 

AATGTAGCAG TGCTTGCTGT TTAATAAGTG GCAGAGCTGT TTTCACTCCA CCTACGCTTG 3900 

TCTAGGACCA AAATTTTAAT CTGTCACTTT GAGCTAAAAC TGAAGCACCA AACC GCTAC A 3960 

AAAGAACGTA GGAGCTGAAT TGTAACTTGA TGGGATTACT ATAGCAGTTG CTACAGTTCT 4020 

AGCTAGCTAC CTTATTCTAT ACGCATCACC CTAACAACCC GGCTGACTGC TGCATCTGAC 4080 

CCCACCGTCC CCTGCTCCAA ACCAACTCTC CTTTCCTTGC ATGCACTACA CCCACTTCCT 4140 

GCAGCTATAT ATACCACCAT ATGCCCATCT TATGAAACCA TCCACAAGAG GAGAAGAAAC 4200 
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AATCAACCAG CAACACTCTT CTCTTATAAC ATAGTACAGC GAAGGTAACT CACATGGCAA 42 60 

CTTCCATGGT CCGTCCTGTA GAAACCCCAA CCCGTGAAAT CAAAAAACTC GACGGCCTGT 432 0 

GGGCATTCAG TCTGGATCGC GAAAACTGTG GAATTGATCA GCGTTGGTGG GAAAGCGCGT 4380 

TACAAGAAAG CCGGGCAATT GCTGTGCCAG GCAGTTTTAA CGATCAGTTC GCCGATGCAG 444 0 

ATATTCGTAA TTATGCGGGC AACGTCTGGT ATCAGCGCGA AGTCTTTATA CCGAAAGGTT 4500 

GGGCAGGCCA GCGTATCGTG CTGCGTTTCG ATGCGGTCAC TCATTACGGC AAAGTGTGGG 4560 

TCAATAATCA GGAAGTGATG GAGCATCAGG GCGGCTATAC GCCATTTGAA GCCGATGTCA 4620 

CGCCGTATGT TATTGCCGGG AAAAGTGTAC GTATCACCGT TTGTGTGAAC AACGAAC TGA 4680 

ACTGGCAGAC TATCCCGCCG GGAATGGTGA TTACC GACGA AAACGGCAAG AAAAAGCAGT 4740 

CTTACTTCCA TGATTTCTTT AACTATGCCG GAATCCATCG CAGCGTAATG CTCTACACCA 4800 

CGCCGAACAC CTGGGTGGAC GATATCACCG TGGTGACGCA TGTCGCGCAA GACTGTAACC 4860 

ACGCGTCTGT TGACTGGCAG GTGGTGGCCA ATGGTGATGT CAGCGTTGAA CTGCGTGATG 492 0 

CGGATCAACA GGTGGTTGCA ACTGGACAAG GCACTAGCGG GAC TTTGCAA GTGGTGAATC 4980 

CGCACCTCTG GCAACCGGGT GAAGGTTATC TCTATGAACT GTGCGTCACA GCCAAAAGCC 5040 

AGACAGAGTG TGATATCTAC CCGCTTCGCG TCGGCATCCG GTCAGTGGCA GTGAAGGGCG 5100 

AACAGTTCCT GATTAACCAC AAACCGTTCT ACTTTACTGG CTTTGGTCGT CATGAAGATG 5160 

CGGACTTACG TGGCAAAGGA TTCGATAACG TGCTGATGGT GCACGACCAC GCATTAATGG 522 0 

ACTGGATTGG GGCCAACTCC TACCGTACCT CGCATTACCC TTACGCTGAA GAGATGCTCG 5280 

ACTGGGCAGA TGAACATGGC ATCGTGGTGA TTGATGAAAC TGCTGCTGTC GGCTTTAACC 5340 

TCTCTTTAGG CATTGGTTTC GAAGCGGGCA ACAAGCCGAA AGAACTGTAC AGCGAAGAGG 5400 

CAGTCAACGG GGAAACTCAG CAAGCGCACT TACAGGCGAT TAAAGAGCTG ATAGCGCGTG 5460 

ACAAAAACCA CCCAAGCGTG GTGATGTGGA GTATTGCCAA CGAACCGGAT ACCCGTCCGC 5520 

AAGTGC AC GG GAATATTTCG CCACTGGCGG AAGCAACGCG TAAACTCGAC CCGACGCGTC 5580 

CGATCACCTG CGTCAATGTA ATGTTCTGCG ACGCTCACAC CGATACCATC AGCGATCTCT 5640 

TTGATGTGCT GTGCCTGAAC CGTTATTACG GATGGTATGT CCAAAGCGGC GATTTGGAAA 5700 

CGGCAGAGAA GGTACTGGAA AAAGAACTTC TGGCCTGGCA GGAGAAACTG CATCAGCCGA 5760 

TTATCATCAC CGAATACGGC GTGGATACGT TAGCCGGGCT GCACTCAATG TACACCGACA 5820 

TGTGGAGTGA AGAGTATCAG TGTGCATGGC TGGATATGTA TCACCGCGTC TTTGATCGCG 5880 

TCAGCGCCGT CGTCGGTGAA CAGGTATGGA ATTTCGCCGA TTTTGCGACC TCGCAAGGCA 5940 
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TATTGCGCGT TGGCGGTAAC AAGAAAGGGA TCTTCACTCG CGACCGCAAA CCGAAGTCGG 6000 

CGGCTTTTCT GCTGCAAAAA CGCTGGACTG GCATGAACTT CGGTGAAAAA CCGCAGCAGG 6060 

GAGGCAAACA ATGAATCAAC AACTCTCCTG GCGCACCATC GTCGGCTACA GCCTCGGGAA 612 0 

TTGCTACCGA GCTTCTCGAG GGCACTGAAG TCGCTTGATG TGCTGAATTG TTTGTGATGT 6180 

TGGTGGCGTA TTTTGTTTAA ATAAGTAAGC ATGGC TGTGA TTTTATCATA TGATCGATCT 62 40 

TTGGGGTTTT ATTTAACACA TTGTAAAATG TGTATCTATT AATAACTCAA TGTATAAGAT 63 00 

GTGTTCATTC TTCGGTTGCC AT AG ATC TGC TTATTTGACC TGTGATGTTT TGACTCCAAA 63 6 0 

AACCAAAATC ACAACTCAAT AAACTCATGG AATATGTCCA CCTGTTTCTT GAAGAGTTCA 642 0 

TCTACCATTC CAGTTGGCAT TTATCAGTGT TGCAGCGGCG CTGTGCTTTG TAACATAACA 6480 

ATTGTTC AC G GCATATATCC AAATC TAG AG AAGCTTATCG ATACCGTCGA CCTCGAGGGG 6540 

GGGCCCGGTA CCCAATTCGC CCTATAGTGA GTCGTATTAC AATTCACTGG CCGTCGTTTT 66 00 

ACAACGTCGT GACTGGGAAA ACCCTGGCGT TACCCAACTT AATCGCCTTG CAGCACATCC 66 60 

CCCTTTCGCC AGCTGGC GT A ATAGCGAAGA GGCCCGCACC GATCGCCCTT CCCAACAGTT 672 0 

GCGCAGCCTG AATGGCGAAT GGCGCGAAAT TGTAAACGTT AATATTTTGT TAAAATTCGC 6780 

GTTAAATTTT TGTTAAATCA GCTCATTTTT TAACCAATAG GCCGAAATCG GCAAAATCCC 6840 

TTATAAATCA AAAGAATAGA CCGAGATAGG GTTGAGTGTT GTTCCAGTTT GGAACAAGAG 6900 

TCCACTATTA AAGAACGTGG ACTCCAACGT CAAAGGGCGA AAAACCGTCT ATCAGGGCGA 69 60 

TGGCCCACTA C GTG AAC CAT CACCCTAATC AAGTTTTTTG GGGTCGAGGT GCCGTAAAGC 702 0 

ACTAAATCGG AACC CTAAAG GGAGCCCCCG ATTTAGAGCT TGAC GGGGAA AGCCGGCGAA 7080 

CGTGGCGAGA AAGGAAGGGA AGAAAGCGAA AGGAGCGGGC GCTAGGGCGC TGGCAAGTGT 7140 

AGCGGTCACG CTGCGCGTAA CCACCACACC CGCCGCGCTT AATGCGCCGC TACAGGGCGC 7200 

GTCCCAGGTG GCACTTTTCG GGGAAATGTG CGCGGAACCC CTATTTGTTT ATTTTTCTAA 72 60 

ATACATTCAA ATATGTATCC GCTCATGAGA CAATAACCCT GATAAATGCT TCAATAATAT 7320 

TGAAAAAGGA AGAGTATGAG TATTCAACAT TTCCGTGTCG CCCTTATTCC CTTTTTTGCG 7380 

GCATTTTGCC TTCCTGTTTT TGCTCACCCA GAAACGCTGG TGAAAGTAAA AGATGCTGAA 7440 

GATCAGTTGG GTGCACGAGT GGGTTACATC GAACTGGATC TCAACAGCGG TAAGATCCTT 7500 

GAGAGTTTTC GCCCCGAAGA ACGTTTTCCA ATGATGAGCA CTTTTAAAGT TCTGCTATGT 7560 

GGCGCGGTAT TATCCCGTAT TGACGCCGGG CAAGAGCAAC TCGGTCGCCG CATACACTAT 7620 
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TCTCAGAATG ACTTGGTTGA GTACTCACCA GTCACAGAAA AGCATCTTAC GGATGGCATG 7680 

ACAGTAAGAG AATTATGCAG TGCTGCCATA ACCATGAGTG ATAACACTGC GGCCAACTTA 7740 

CTTCTGACAA CGATCGGAGG ACCGAAGGAG CTAACCGCTT TTTTGCACAA CATGGGGGAT 7 800 

CATGTAACTC GCCTTGATCG TTGGGAACCG GAGCTGAATG AAGCCATACC AAACGACGAG 7860 

CGTGACACCA CGATGCCTGT AGCAATGGCA AC AAC GTTGC GCAAACTATT AACTGGCGAA 7920 

CTACTTACTC TAGCTTCCCG GCAACAATTA AT AG AC TGG A TGGAGGCGGA TAAAGTTGCA 7980 

GGACCACTTC TGCGCTCGGC CCTTCCGGCT GGCTGGTTTA TTGCTGATAA ATCTGGAGCC 8040 

GGTGAGCGTG GGTCTCGCGG TATCATTGCA GCACTGGGGC CAGATGGTAA GCCCTCCCGT 8100 

ATCGTAGTTA TCTACACGAC GGGGAGTCAG GCAACTATGG ATGAACGAAA TAGACAGATC 8160 

GCTGAGATAG GTGCCTCACT GATTAAGCAT TGGTAACTGT CAGACCAAGT TTACTCATAT 8220 

ATACTTTAGA TTGATTTAAA ACTTCATTTT TAATTTAAAA GGATCTAGGT GAAGATCCTT 8280 

TTTGATAATC TCATGACCAA AATCCCTTAA CGTGAGTTTT CGTTCCACTG AGCGTCAGAC 8340 

CCCGTAGAAA AGATCAAAGG ATCTTCTTGA GATCCTTTTT TTCTGCGCGT AATCTGCTGC 8400 

TTGCAAACAA AAAAACC AC C GCTACCAGCG GTGGTTTGTT TGCCGGATCA AGAGCTACCA 8460 

ACTCTTTTTC C GAAGGT AAC TGGCTTCAGC AGAGCGCAGA T AC C AAAT AC TGTCCTTCTA 8520 

GTGTAGCCGT AGTTAGGCCA CCACTTCAAG AACTCTGTAG CACCGCCTAC ATACCTCGCT 8580 

CTGCTAATCC TGTTACCAGT GGCTGCTGCC AGTGGCGATA AGTCGTGTCT TACCGGGTTG 8640 

GACTCAAGAC GATAGTTACC GGATAAGGCG CAGCGGTCGG GCTGAACGGG GGGTTCGTGC 87 00 

ACACAGCCCA GCTTGGAGCG AACGACCTAC ACCGAACTGA GATACCTACA GCGTGAGCTA 87 60 

TGAGAAAGCG CCACGCTTCC CGAAGGGAGA AAGGCGGACA GGTATCC GGT AAGCGGCAGG 8820 

GTCGGAACAG GAGAGCGCAC GAGGGAGCTT CCAGGGGGAA ACGCCTGGTA TCTTTATAGT 8880 

CCTGTCGGGT TTCGCCACCT CTGACTTGAG CGTCGATTTT TGTGATGCTC GTCAGGGGGG 8940 

CGGAGCCTAT GGAAAAACGC CAGCAACGCG GCCTTTTTAC GGTTCCTGGC CTTTTGCTGG 9000 

CCTTTTGCTC ACATGTTCTT TCCTGCGTTA TCCCCTGATT CTGTGGATAA CCGTATTACC 9060 

GCCTTTGAGT GAGCTGATAC CGCTCGCCGC AGCCGAACGA CCGAGCGCAG CGAGTCAGTG 9120 

AGCGAGGAAG CGGAAGAGCG CCCAATACGC AAACCGCCTC TCCCCGCGCG TTGGCCGATT 9180 

CATTAATGCA GCTGGCACGA CAGGTTTCCC GACTGGAAAG CGGGCAGTGA GCGCAACGCA 9240 

ATTAATGTGA GTTAGCTCAC TCATTAGGCA CCCCAGGCTT TACACTTTAT GCTTCCGGCT 93 00 

CGTATGTTGT GTGGAATTGT GAGCGGATAA CAATTTCACA CAGGAAACAG CTATGACCAT 93 60 
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GATTACGCCA AGCTCGGAAT TAACCCTCAC TAAAGGGAAC AAAAGCTG 



9408 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

TTATCTCGAG GGCACTGAAG TCGCTTGATG TGCTGAATT 3 9 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
GGGGAAGCTT CTCTAGATTT GGATATATGC CGTGAACAAT TG 42 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9335 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 



AGCTTGCATG 


CCTGCAGGCC 


GGCCTTAATT 


AAGCGGCCGC 


CAGTGTGATG 


GATATCTGCA 


60 


GAATTCGGCT 


TGGGGGATCC 


TCTAGACAAT 


GATATACATA 


GATAAAAACC 


ACTGTTGTAA 


120 


CTTGTAAGCC 


ACTAGCTCAC 


GTTCTCCATG 


AGCTCTTCTC 


TCTGCTGTTT 


CTTCCTCTGC 


180 


TAACTGCGTT 


ATGATATGAC 


GTCGTATAAA 


TAATCTCACA 


ATACTTCCTT 


ATTTTCAGCA 


240 


TGGCCTCTTT 


TATGTTTATT 


TAACAGTAGC 


AACCAACGCC 


GCTCGATGTT 


TCCTTCAAGA 


300 


AACGGCCACT 


CACTATGTGG 


TGTGCAGAAG 


AACAAATGTA 


AGCAGCTCCT 


ACAGGTACCA 


360 


GTAGTCATGT 


CAGTGTGGAA 


GCTTTCCAAC 


CAACGCCTCC 


TTCGAGGAAC 


CTGGTCGTGC 


420 
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TGACATGAAT GTAGGCCATG CAAGCACAAG CACCTAACGC GAATCATCAC GACGCGCCGT 480 

GTACTGGGCG TTGGTACATC ACACCCCGCG TTTGACCTGA TCGGAAGCAT GCGTGTGTGT 540 

TGGCTGCAGG ACCGGCTATA GGTTTCCTGC ATTGGACAGC AGAAGCCAGT CATGTTAGGC 600 

ACTCACGCGC TCCTGCCGTT TGATGAATCA TCCGGTCTTT CGTATTGATC ACTAGTTCAC 660 

TACGCTGATA TAGCAAATTT TAAGATGTGA AACCACGAGA CGAGCGATAA ATCTTAGACG 72 0 

TTACCTATCC ATATGAAGCT TGTGC GAAAA AAAGGCGTGC CGCTGTAGCA TCATTCGTAT 780 

ACACTTTTGT CCCCAAAGAC AGGG AT AC GA ATCCATGCTC GACAGAACCC TCCCTTCCCT 840 

GCAGATAACG AC AC TT AAGT ATAACAAAAG TAGTTGGATT ATTTCAGAAG CAAAATCTCA 900 

CTTTTCGCTG GCCTTTTTGT AC TTTGGTT A CTTGAGTTCA GACAGTGTAT GCTATATTGT 9 60 

CATGTGCTGC GTAAGGTTTA AATATGGTTC GACAAATATA TCAGTATATC ACTACTTTGT 102 0 

TATGGGTGGG GCCTAGCACA AACTTGATAC AGC T AGGAT A AAGTTAGAAC GATGACTGAT 1080 

CTACTGTAAA GCGACACCTG- TCCTGTTATG GTAGTTTAAG TCCATTCCTG GACGACTCCA 1140 

GATCCAGGAT ATGATGCTGT TACATAATGC GATTGTTCAC AATAAAATTG CATGATGTTC 1200 

TTCTACTCTT TAGGCAGTTT TGTTCAACAG GCAAGTTGCA TAATGCATGT GCATATATGA 12 6 0 

GCAGCATAAT CATCAATTAA TCATAGGTTC GTCATTTTAG * TTTCACTCCT TCACATTATT 132 0 

CCAGCCCTTG AAGAAAAATG TAGCAGTGCT TGCTGTTTAA TAAGTGGCAG AGCTGTTTTC 1380 

ACTCCACCTA CGCTTGTCTA GGACCAAAAT TTTAATCTGT CACTTTGAGC TAAAACTGAA 1440 

GCACCAAACC GCTACAAAAG AACGTAGGAG CTGAATTGTA ACTTGATGGG ATTACTATAG 1500 

CAGTTGCTAC AGTTCTAGCT AGCTACC TTA TTCTATACGC ATCACCCTAA CAACCCGGCT 1560 

GACTGCTGCA TCTGACCCCA CCGTCCCCTG CTCCAAACCA ACTCTCCTTT CCTTGCATGC 1620 

ACTACACCCA CTTCCTGCAG CTATATATAC CACCATATGC CCATCTTATG AAACCATCCA 1680 

CAAGAGGAGA AGAAACAATC AACCAGCAAC ACTCTTCTCT TATAACATAG TACAGCGAAG 1740 

GAGATCCTGA CTGCTTTGTC AAGGTTCAAT TCTGCTTCCT CTGTTATGTT CTTTATATTA 1800 

CATGCTCTGA CAAAGCTATA AAGCTTGATA CTGCAGTATA ATATAACAAG TTAGCTACAC 1860 

AAGTTTTGTA CTTCAAGTCT TTTAACTATA TGTTGGTGCA ATAAGATTAT GAGTAATCCA 1920 

TATGAAGGTG TTGCAAGAGA ACATGAAAGG CAAAGATAAA CGGATGAACC CATTACTAGC 1980 

TTTGGC TGT A TCAGACCAAT AACTTGAAAT GCACTTGTGC TAGCATGCCT AAGTATTAGA 2040 

AAAGGTAGCA TGGGAGAATC TATATTATTT TGGCTAACTT CTTTAGTTAC TATTGATTGA 2100 

TGAGAAAGCC TACCATTGCC CATGCCAGCC CTAATGTCCC GGTGACATGA TTGAGCCAGT 2160 
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ACTATGATTA ATTTACTCTA TTGTTCTCCT TTTTTGAGTG CTGTATAAGA TGTCCTTTTT 222 0 

TTGAGCCACT CGAGAAGATG TTTACTTAAC TCTAGTGCGC AATGATTGGA GCTCTCAGTG 2280 

CAACGCATGT GCTCTGTAAT CTACTGTCAC CACTACTCTG TAGTGTGTGC TTAAACTCTA 2340 

AACTATTCCA CGTGGCTAGT AATTACCAAT CATTTACAAC ACTGTTACAT GTGTAGGGCT 24 00 

GCGATCCATG GTCCGTCCTG TAGAAACCCC AACCCGTGAA ATCAAAAAAC TCGACGGCCT 2460 

GTGGGCATTC AGTCTGGATC GCGAAAACTG TGGAATTGAT CAGCGTTGGT GGGAAAGCGC 252 0 

GTTACAAGAA AGCCGGGCAA TTGCTGTGCC AGGCAGTTTT AACGATCAGT TCGCCGATGC 2580 

AGATATTCGT AATTATGCGG GCAACGTCTG GTATCAGCGC GAAGTCTTTA TACCGAAAGG 2640 

TTGGGCAGGC CAGCGTATCG TGCTGCGTTT CGATGCGGTC ACTCATTACG GCAAAGTGTG 2700 

GGTCAATAAT CAGGAAGTGA TGGAGCATCA GGGCGGCTAT ACGCCATTTG AAGCCGATGT 2760 

CACGCCGTAT GTTATTGCCG GGAAAAGTGT ACGTATC AC C GTTTGTGTGA ACAACGAACT 282 0 

GAACTGGCAG ACTATCCCGC CGGGAATGGT GATTACCGAC GAAAACGGCA AGAAAAAGCA 2880 

GTCTTACTTC CATGATTTCT TTAACTATGC CGGAATCCAT CGCAGCGTAA TGCTCTACAC 2940 

CACGCCGAAC ACCTGGGTGG ACGATATCAC CGTGGTGACG CATGTCGCGC AAGACTGTAA 3 000 

CCACGCGTCT GTTGACTGGC AGGTGGTGGC CAATGGTGAT GTCAGCGTTG AACTGCGTGA 30 60 

TGCGGATCAA CAGGTGGTTG CAACTGGACA AGGCACTAGC GGGACTTTGC AAGTGGTGAA 312 0 

TCCGCACCTC TGGCAACCGG GTGAAGGTTA TCTCTATGAA CTGTGCGTCA CAGCCAAAAG 3180 

CCAGACAGAG TGTGATATCT ACCCGCTTCG CGTCGGCATC CGGTCAGTGG CAGTGAAGGG 3240 

CGAACAGTTC CTGATTAACC ACAAACCGTT CTACTTTACT GGCTTTGGTC GTCATGAAGA 33 00 

TGCGGACTTA CGTGGCAAAG GATTCGATAA CGTGCTGATG GTGCACGACC ACGCATTAAT 33 60 

GGACTGGATT GGGGCCAACT CCTACCGTAC CTCGCATTAC CCTTACGCTG AAGAGATGCT 3420 

CGACTGGGCA GATGAACATG GCATCGTGGT GATTGATGAA ACTGCTGCTG TCGGCTTTAA 3480 

CCTCTCTTTA GGCATTGGTT TCGAAGC GGG CAACAAGCCG AAAGAACTGT ACAGCGAAGA 3540 

GGCAGTCAAC GGGGAAACTC AGCAAGCGCA CTTACAGGCG ATTAAAGAGC TGATAGCGCG 3 600 

TGACAAAAAC CACCCAAGCG TGGTGATGTG GAGTATTGCC AACGAAC CGG ATACCCGTCC 3660 

GCAAGTGCAC GGG AATATTT CGCCACTGGC GGAAGCAACG CGTAAACTCG ACCCGACGCG 3720 

TCCGATCACC TGCGTCAATG TAATGTTCTG CGACGCTCAC ACCGATACCA TCAGCGATCT 3780 

CTTTGATGTG CTGTGCCTGA ACCGTTATTA CGGATGGTAT GTCCAAAGCG GCGATTTGGA 3840 
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AACGGCAGAG AAGGTACTGG AAAAAGAACT TCTGGCCTGG CAGGAGAAAC TGCATCAGCC 3900 

GATTATCATC ACCGAATACG GCGTGGATAC GTTAGCCGGG CTGCACTCAA TGTAC AC CGA 3960 

CATGTGGAGT GAAGAGTATC AGTGTGCATG GCTGGATATG TATCACCGCG TCTTTGATCG 4020 

CGTCAGCGCC GTCGTCGGTG AACAGGTATG GAATTTCGCC GATTTTGCGA CCTCGCAAGG 4080 

CATATTGCGC GTTGGCGGTA ACAAGAAAGG GATCTTCACT CGCGACCGCA AACCGAAGTC 4140 

GGCGGCTTTT CTGCTGCAAA AACGCTGGAC TGGCATGAAC TTC GGTG AAA AACCGCAGCA 42 00 

GGGAGGCAAA CAATGAATCA ACAACTCTCC TGGCGCACCA TCGTCGGCTA CAGCCTC GGG 42 60 

AATTGCTACC GAGCTTCTCG AGGGCACTGA AGTCGCTTGA TGTGCTGAAT TGTTTGTGAT 4320 

GTTGGTGGCG TATTTTGTTT AAATAAGTAA GCATGGCTGT GATTTTATCA TATGATCGAT 43 80 

CTTTGGGGTT TTATTTAACA CATTGTAAAA TGTGTATCTA TTAATAACTC AATGTATAAG 4440 

ATGTGTTCAT TCTTCGGTTG C C AT AGATCT GCTTATTTGA CCTGTGATGT TTTGACTCCA 4500 

AAAACCAAAA TCACAACTCA ATAAACTCAT GGAATATGTC CACCTGTTTC TTGAAGAGTT 4560 

CATCTACCAT TCCAGTTGGC ATTTATCAGT GTTGCAGCGG CGCTGTGCTT TGTAACATAA 4620 

CAATTGTTCA CGGCATATAT CC AAATC TAG AGAAGCTTAT CGATACCGTC GACCTCGAGG 4680 

GGGGGCCCGG TACCCAATTC GCCCTATAGT GAGTCGTATT *ACAATTCACT GGCCGTCGTT 4740 

TTACAACGTC GTGACTGGGA AAACCCTGGC GTTACCCAAC TTAATCGCCT TGCAGCACAT 4800 

CCCCCTTTCG CCAGAAACGC CCGGGCATTT AAATGGCGCG CCGCGATCGC TTGC AGATCT 4860 

GCATGGGTGG AGACTTTTCA ACAAAGGGTA ATATCC GGAA ACCTCCTCGG ATTCCATTGC 4920 

CCAGCTATCT GTCACTTTAT TGTGAAGATA GTGGAAAAGG AAGGTGGCTC C T AC AAATGC 4980 

CATCATTGCG ATAAAGGAAA GGCCATCGTT GAAGATGCCT CTGCCGACAG TGGTCCCAAA 5040 

GATGGACCCC CACCCACGAG GAGCATCGTG GAAAAAGAAG ACGTTCCAAC CACGTCTTCA 5100 

AAGCAAGTGG ATTGATGTGA TCATCGATGG AGACTTTTCA ACAAAGGGTA ATATCCGGAA 5160 

ACCTCCTCGG ATTCCATTGC CCAGCTATCT GTCACTTTAT TGTGAAGATA GTGGAAAAGG 5220 

AAGGTGGCTC CTACAAATGC CATCATTGCG ATAAAGGAAA GGCCATCGTT GAAGATGCCT 5280 

CTGCCGACAG TGGTCCCAAA GATGGACCCC CACCCACGAG GAGCATCGTG GAAAAAGAAG 5340 

ACGTTCCAAC CACGTCTTCA AAGCAAGTGG ATTGATGTGA TATCTCCACT GACGTAAGGG 5400 

ATGACGCACA ATCCCACTAT CCTTCGCAAG ACCCTTCCTC TATATAAGGA AGTTCATTTC 5460 

ATTTGGAGAG AAC ACGGGGG ACTCTAGAGG ATCCAGCTGA AGGCTCGACA AGGCAGTCCA 5520 

CGGAGGAGCT GATATTTGGT GGACAAGCTG TGGATAGGAG CAACCCTATC CCTAATATAC 5580 
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CAGCACCACC AAGTCAGGGC AATCCCCAGA TCAAGTGCAA AGGTCCGCCT TGTTTCTCCT 5640 

CTGTCTCTTG ATCTGACTAA TCTTGGTTTA TGATTCGTTG AGTAATTTTG GGGAAAGCTC 5700 

CTTTGCTGCT CCACACATGT CCATTCGAAT TTTACCGTGT TTAGCAAGGG CGAAAAGTTT 57 60 

GCATCTTGAT GATTTAGCTT GACTATGC GA TTGCTTTCCT GGACCCGTGC AGCTGCGGAC 582 0 

GGATCTGGGG C C ATTTGTTC CAGGCACGGG ATAAGCATTC AGCCATGGCC CCAGAACGAC 5880 

GCCCGGCCGA CATCCGCCGT GCCACCGAGG CGGACATGCC GGCGGTCTGC ACCATCGTCA 5940 

ACCACTACAT CGAGACAAGC ACGGTCAACT TCCGTACCGA GCCGCAGGAA CCGCAGGAGT 6000 

GGACGGACGA CCTCGTCCGT CTGCGGGAGC GCTATCCCTG GCTCGTCGCC GAGGTGGACG 6060 

GCGAGGTCGC CGGCATCGCC TACGCGGGCC CCTGGAAGGC ACGCAACGCC TACGACTGGA 612 0 

CGGCCGAGTC GACCGTGTAC GTCTCCCCCC GCCACCAGCG GACGGGACTG GGCTCCACGC 6180 

TCTACACCCA CCTGCTGAAG TCCCTGGAGG CACAGGGCTT CAAGAGCGTG GTCGCTGTCA 6240 

TCGGGCTGCC CAACGACCCG AGCGTGCGCA TGCACGAGGC GCTCGGATAT GCCCCCCGCG 63 00 

GCATGCTGCG GGCGGCCGGC TTCAAGCACG GGAACTGGCA TGACGTGGGT TTCTGGCAGC 63 60 

TGGACTTCAG CCTGCCGGTA CCGCCCCGTC CGGTCCTGCC CGTCACCGAA ATCTGATGAG 642 0 

ATCTGAGCTC GAATTTCCCC GATCGTTCAA ACATTTGGCA ATAAAGTTTC TTAAGATTGA 6480 

ATCCTGTTGC CGGTCTTGCG ATGATTATCA TATAATTTCT GTTGAATTAC GTTAAGCATG 6540 

TAATAATTAA CATGTAATGC ATGACGTTAT TTATGAGATG GGTTTTTATG ATTAGAGTCC 6600 

CGCAATTATA CATTTAATAC GC GAT AG AAA ACAAAATATA GCGCGCAAAC TAGGATAAAT 6660 

TATCGCGCGC GGTGTCATCT ATGTTACTAG ATCGATCGGG AATTCACTGG CCGTCGTTTT 6720 

AC AAC GTCGT GAC TGGGAAA ACCCTGGCGT TACCCAACTT AATCGCCTTG CAGCACATCC 6780 

CCCTTTCGCC AGCTGGC GTA AT AGC G AAG A GGCCCGCACC GATCGCCCTT CCCAACAGTT 6840 

GCGCAGCCTG AATGGCGAAT GGCGCCTGAT GCGGTATTTT CTCCTTACGC ATCTGTGCGG 6900 

TATTTCACAC CGCATATGGT GCACTCTCAG TACAATCTGC TCTGATGCCG CATAGTTAAG 6960 

CCAGCCCCGA CACCCGCCAA CACCCGCTGA CGCGCCCTGA CGGGCTTGTC TGCTCCCGGC 7020 

ATCCGCTTAC AGACAAGCTG TGACCGTCTC CGGGAGCTGC ATGTGTCAGA GGTTTTCACC 7080 

GTCATCACCG AAACGCGCGA GACGAAAGGG CCTCGTGATA CGCCTATTTT TATAGGTTAA 7140 

TGTCATGATA ATAATGGTTT CTTAGACGTC AGGTGGCACT TTTCGGGGAA ATGTGCGCGG 7200 

AACCCCTATT TGTTTATTTT TCTAAATACA TTCAAATATG TATCCGCTCA TGAGACAATA 7260 
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ACCCTGATAA ATGCTTCAAT AATATTGAAA AAGGAAGAGT ATGAGTATTC AACATTTCCG 7320 

TGTCGCCCTT ATTCCCTTTT TTGCGGCATT TTGCCTTCCT GTTTTTGCTC ACCCAGAAAC 7380 

GCTGGTGAAA GTAAAAGATG CTGAAGATCA GTTGGGTGCA CGAGTGGGTT ACATCGAACT 7440 

GGATCTCAAC AGCGGTAAGA TCCTTGAGAG TTTTCGCCCC GAAGAACGTT TTCCAATGAT 7500 

GAGCACTTTT AAAGTTCTGC TATGTGGCGC GGTATTATCC CGTATTGACG CCGGGCAAGA 7560 

GCAACTCGGT CGCCGCATAC ACTATTCTCA GAATGACTTG GTTGAGTACT CACCAGTCAC 7620 

AGAAAAGCAT CTTACGGATG GCATGACAGT AAGAGAATTA TGCAGTGCTG CCATAACCAT 7680 

GAGTGATAAC ACTGCGGCCA ACTTACTTCT GACAACGATC GGAGGAC CGA AGGAGCTAAC 7740 

CGCTTTTTTG CACAACATGG GGGATCATGT AACTCGCCTT GATCGTTGGG AACCGGAGCT 7800 

GAATGAAGCC ATACCAAACG ACGAGCGTGA CACCACGATG CCTGTAGCAA TGGCAACAAC 7860 

GTTGCGCAAA CTATTAACTG GC GAACT ACT TACTCTAGCT TCCCGGCAAC AATTAATAGA 7920 

CTGGATGGAG GCGGATAAAG TTGCAGGACC ACTTCTGCGC TCGGCCCTTC CGGCTGGCTG 7980 

GTTTATTGCT GATAAATCTG GAGCC GGTGA GCGTGGGTCT CGCGGTATCA TTGCAGCACT 8040 

GGGGCCAGAT GGTAAGCCCT CCCGTATCGT AGTTATCTAC ACGACGGGGA GTCAGGCAAC 8100 

TATGGATGAA CGAAATAGAC AGATCGC TGA GAT AGGTGC C TCACTGATTA AGCATTGGTA 8160 

ACTGTCAGAC CAAGTTTACT CATATATACT TTAGATTGAT TTAAAACTTC ATTTTTAATT 822 0 

TAAAAGGATC TAGGTGAAGA TCCTTTTTGA TAATCTCATG ACCAAAATCC CTTAACGTGA 8280 

GTTTTCGTTC CACTGAGCGT CAGACCCCGT AGAAAAGATC AAAGGATCTT CTTGAGATCC 8340 

TTTTTTTCTG CGCGTAATCT GCTGCTTGCA AACAAAAAAA CCACCGCTAC CAGCGGTGGT 8400 

TTGTTTGCCG GATCAAGAGC TACCAACTCT TTTTCCGAAG GTAACTGGCT TCAGCAGAGC 8460 

GCAGATACCA AATACTGTCC TTCTAGTGTA GCC GTAGTTA GGCCACCACT TCAAGAACTC 8520 

TGTAGCACCG CCTACATACC TCGCTCTGCT AATCCTGTTA CCAGTGGCTG CTGCCAGTGG 8580 

CGATAAGTCG TGTCTTACCG GGTTGGACTC AAG AC GAT AG TT AC CGGAT A AGGCGCAGCG 8640 

GTCGGGCTGA AC GGGGGGTT CGTGCACACA GCCCAGCTTG GAGC GAACGA CCTACACCGA 87 00 

ACTGAGATAC CTACAGCGTG AGCATTGAGA AAGCGCCACG CTTCCCGAAG GGAGAAAGGC 87 60 

GGACAGGTAT CCGGTAAGCG GCAGGGTCGG AACAGGAGAG CGCACGAGGG AGCTTCCAGG 8820 

GGGAAACGCC TGGTATCTTT ATAGTCCTGT CGGGTTTCGC CACCTCTGAC TTGAGCGTCG 8880 

ATTTTTGTGA TGCTCGTCAG GGGGGCGGAG CCTATGGAAA AACGCCAGCA ACGCGGCCTT 8940 

TTTACGGTTC CTGGCCTTTT GCTGGCCTTT TGCTCACATG TTCTTTCCTG CGTTATCCCC 9000 
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TGATTCTGTG GATAACCGTA TTACCGCCTT TGAGTGAGCT GATACCGCTC GCCGCAGCCG 9060 

AACGACCGAG CGCAGCGAGT CAGTGAGCGA GGAAGCGGAA GAGCGCCCAA TACGCAAACC 912 0 

GCCTCTCCCC GCGCGTTGGC CGATTCATTA ATGCAGCTGG CACGACAGGT TTCCCGACTG 9180 

GAAAGCGGGC AGTGAGCGCA ACGCAATTAA TGTGAGTTAG CTCACTCATT AGGCACCCCA 9240 

GGCTTTACAC TTTATGCTTC CGGCTCGTAT GTTGTGTGGA ATTGTGAGC G GATAACAATT 930 0 

TCACACAGGA AACAGCTATG ACCATGATTA CGCCA 93 3 5 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNE SS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

GGGGGATCCT CTAGACAATG ATATACATAG ATAAAAACC 3 9 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNE SS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:21: 

GGGAGATCTC CTTCGCTGTA C T ATGTT AT A AGAGAAGAG 39 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 

GGGGGATCCT GACTGCTTTG TCAAGGTTCA ATTCTGCTT 39 
(2) INFORMATION FOR SEQ ID NO: 23: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 

GGGCCATGGA TCGCAGCCCT ACACATGTAA CAGTGTTGT 39 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

AAAGAGCTCT GAGGGCACTG AAGTCGCTTG ATGTGC 3 6 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 

GGGGAATTCT TGGATATATG CCGTGAACAA TTGTTATGTT AC 42 
(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5897 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

AGCTTGCATG CCTGCAGATC TGCATGGGTG GAGACTTTTC AACAAAGGGT AATATCCGGA 60 
AACCTCCTCG GATTCCATTG CCCAGCTATC TGTCACTTTA TTGTGAAGAT AGTGGAAAAG 120 
GAAGGTGGCT CCTACAAATG CCATCATTGC GATAAAGGAA AGGCCATCGT TGAAGATGCC 180 
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TCTGCCGACA GTGGTCCCAA AGATGG AC CC CCACCCACGA GGAGCATCGT GGAAAAAGAA 240 
GACGTTCCAA CCACGTCTTC AAAGCAAGTG GATTGATGTG ATCATCGATG GAGACTTTTC 300 
AACAAAGGGT AATATCCGGA AACCTCCTCG GATTCCATTG CCCAGCTATC TGTCACTTTA 360 
TTGTGAAGAT AGTGGAAAAG GAAGGTGGCT CCTACAAATG CCATCATTGC GATAAAGGAA 420 
AGGCCATCGT TGAAGATGCC TCTGCCGACA GTGGTCCCAA AGATGGACCC CCACCCACGA 480 
GGAGCATCGT GGAAAAAGAA GACGTTCCAA CCACGTCTTC AAAGCAAGTG GATTGATGTG 540 
ATATCTCCAC TGACGTAAGG GATGACGCAC AATCCCACTA TCCTTCGCAA GACCCTTCCT 600 
CTATATAAGG AAGTTCATTT CATTTGGAGA G AAC AC GGGG GACTCTAGAG GATCCAGCTG 660 
AAGGCTCGAC AAGGCAGTCC ACGGAGGAGC TGATATTTGG TGGACAAGCT GTGGATAGGA 720 
GCAACCCTAT CCCTAATATA CCAGCACCAC CAAGTCAGGG CAATCCCCAG ATCAAGTGCA 780 
AAGGTCCGCC TTGTTTCTCC TCTGTCTCTT GATCTGACTA ATCTTGGTTT ATGATTCGTT 840 
GAGTAATTTT GGGGAAAGCT CCTTTGCTGC TCCACACATG TCCATTCGAA TTTTACCGTG 900 
TTTAGCAAGG GCGAAAAGTT TGCATCTTGA TGATTTAGCT TGACTATGCG ATTGCTTTCC 960 

TGGACCCGTG CAGCTGCGGA CGGATCTGGG GC C ATTTGTT CCAGGCACGG GATAAGCATT 1020 

CAGCCATGGT CCGTCCTGTA GAAACCCCAA CCCGTGAAAT CAAAAAACTC GACGGCCTGT 1080 

GGGCATTCAG TCTGGATCGC GAAAACTGTG GAATTGATCA GCGTTGGTGG GAAAGCGCGT 1140 

TACAAGAAAG CCGGGCAATT GCTGTGCCAG GCAGTTTTAA CGATCAGTTC GCCGATGCAG 12 00 

ATATTCGTAA TTATGCGGGC AACGTCTGGT ATCAGCGCGA AGTCTTTATA CCGAAAGGTT 12 60 

GGGCAGGCCA GCGTATCGTG CTGCGTTTCG ATGCGGTCAC TCATTACGGC AAAGTGTGGG 1320 

TCAATAATCA GGAAGTGATG GAGCATCAGG GCGGCTATAC GCCATTTGAA GCCGATGTCA 1380 

CGCCGTATGT TATTGCCGGG AAAAGTGTAC GTATCACCGT TTGTGTGAAC AACGAACTGA 1440 

ACTGGCAGAC TATCCCGCCG GGAATGGTGA TTACCGACGA AAACGGCAAG AAAAAGCAGT 1500 

CTTACTTCCA TGATTTCTTT AACTATGCCG GAATCCATCG CAGCGTAATG CTCTACACCA 1560 

CGCCGAACAC CTGGGTGGAC GATATCACCG TGGTGACGCA TGTCGCGCAA GACTGTAACC 1620 

ACGCGTCTGT TGACTGGCAG GTGGTGGCCA ATGGTGATGT CAGCGTTGAA CTGCGTGATG 1680 

CGGATCAACA GGTGGTTGCA ACTGGACAAG GCACTAGCGG GACTTTGCAA GTGGTGAATC 1740 

CGCACCTCTG GCAACCGGGT GAAGGTTATC TCTATGAACT GTGCGTCACA GCCAAAAGCC 1800 

AGACAGAGTG TGATATCTAC CCGCTTCGCG TCGGCATCCG GTCAGTGGCA GTGAAGGGCG 1860 
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AACAGTTCCT GATTAACCAC AAACC GTTCT ACTTTACTGG CTTTGGTCGT CATGAAGATG 1920 

CGGACTTACG TGGCAAAGGA TTCGATAACG TGCTGATGGT GCACGACCAC GCATTAATGG 1980 

AC TGGATTGG GGCCAACTCC TACCGTACCT CGCATTACCC TTACGCTGAA GAGATGCTCG 2040 

ACTGGGCAGA TGAACATGGC ATC GTGGTGA TTGATGAAAC TGCTGCTGTC GGCTTTAACC 2100 

TCTCTTTAGG CATTGGTTTC GAAGCGGGCA ACAAGCCGAA AGAACTGTAC AGCGAAGAGG 2160 

CAGTCAACGG GGAAAC TC AG CAAGCGCACT TACAGGCGAT TAAAGAGCTG ATAGCGCGTG 222 0 

ACAAAAACCA CCCAAGCGTG GTGATGTGGA GTATTGCCAA CGAACCGGAT ACCCGTCCGC 2280 

AAGTGCACGG GAATATTTCG CCACTGGCGG AAGCAACGCG TAAACTCGAC CCGACGCGTC 2340 

CGATCACCTG CGTCAATGTA ATGTTCTGCG ACGCTCACAC CGATACCATC AGCGATCTCT 2400 

TTGATGTGCT GTGC CTGAAC CGTTATTACG GATGGTATGT CCAAAGCGGC GATTTGGAAA 2460 

CGGCAGAGAA GGTACTGGAA AAAGAACTTC TGGCCTGGCA GGAGAAACTG CATCAGCCGA 252 0 

TTATCATCAC CGAATACGGC GTGGATACGT TAGCCGGGCT GCACTCAATG TACACCGACA 2580 

TGTGGAGTGA AGAGTATCAG TGTGCATGGC TGGATATGTA TCACCGCGTC TTTGATCGCG 2640 

TCAGCGCCGT CGTCGGTGAA CAGGTATGGA ATTTCGCCGA TTTTGCGACC TCGCAAGGCA 2700 

TATTGCGCGT TGGCGGTAAC AAGAAAGGGA TCTTCACTCG » CGACCGCAAA CCGAAGTCGG 27 60 

CGGCTTTTCT GCTGCAAAAA CGCTGGACTG GCATGAACTT CGGTGAAAAA CCGCAGCAGG 2820 

GAGGCAAACA ATGAATCAAC AACTCTCCTG GCGCACCATC GTCGGCTACA GCCTCGGTGG 2880 

GGAATTGGAG AGC TCTGAGG GC AC TGAAGT CGCTTGATGT GCTGAATTGT TTGTGATGTT 2940 

GGTGGC GT AT TTTGTTTAAA TAAGTAAGCA TGGCTGTGAT TTTATCATAT GATC GATCTT 3000 

TGGGGTTTTA TTTAACACAT TGTAAAATGT GTATCTATTA ATAACTCAAT GTATAAGATG 3060 

TGTTCATTCT TCGGTTGCCA TAGATCTGCT TATTTGACCT GTGATGTTTT GACTCCAAAA 3120 

ACCAAAATCA CAACTCAATA AACTCATGGA ATATGTCCAC CTGTTTCTTG AAGAGTTCAT 3180 

CTACCATTCC AGTTGGCATT TATCAGTGTT GCAGCGGCGC TGTGCTTTGT AACATAACAA 3240 

TTGTTCACGG CATATATCCA AGAATTCACT GGCCGTCGTT TTACAACGTC GTGACTGGGA 33 00 

AAACCCTGGC GTTACCCAAC TTAATCGCCT TGCAGCACAT CCCCCTTTCG CCAGCTGGCG 33 60 

TAATAGCGAA GAGGCCCGCA CCGATCGCCC TTCCCAACAG TTGCGCAGCC TGAATGGCGA 3420 

ATGGCGCCTG ATGCGGTATT TTCTCCTTAC GCATCTGTGC GGTATTTCAC ACCGCATATG 3480 

GTGCACTCTC AGTACAATCT GCTCTGATGC CGCATAGTTA AGCCAGCCCC GACACCCGCC 3540 

AACACCCGCT GACGCGCCCT GACGGGCTTG TCTGCTCCCG GCATCCGCTT ACAGACAAGC 3600 
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TGTGACCGTC TCCGGGAGCT GCATGTGTCA GAGGTTTTCA CCGTCATCAC CGAAACGCGC 3 660 

GAGACGAAAG GGCCTCGTGA TACGCCTATT TTTATAGGTT AATGTCATGA TAATAATGGT 3720 

TTCTTAGACG TCAGGTGGCA CTTTTCGGGG AAATGTGCGC GGAACCCCTA TTTGTTTATT 3780 

TTTCTAAATA CATTCAAATA TGTATCCGCT CATGAGACAA TAACCCTGAT AAATGCTTCA 3840 

ATAATATTGA AAAAGGAAGA GTATGAGTAT TCAACATTTC CGTGTCGCCC TTATTCCCTT 3900 

TTTTGCGGCA TTTTGCCTTC CTGTTTTTGC TCACCCAGAA ACGCTGGTGA AAGTAAAAGA 3960 

TGCTGAAGAT CAGTTGGGTG CACGAGTGGG TTACATCGAA CTGGATCTCA ACAGCGGTAA 402 0 

GATCCTTGAG AGTTTTCGCC CCGAAGAACG TTTTCCAATG ATGAGCACTT TTAAAGTTCT 408 0 

GCTATGTGGC GCGGTATTAT CCCGTATTGA CGCCGGGCAA GAGCAACTCG GTCGCCGCAT 4140 

ACACTATTCT CAGAATGACT TGGTTGAGTA CTCACCAGTC ACAGAAAAGC ATCTTAC GGA 4200 

TGGCATGACA GTAAGAGAAT TATGCAGTGC TGCCATAACC ATGAGTGATA ACACTGCGGC 42 6 0 

CAACTTACTT CTGACAACGA TCGGAGGACC GAAGGAGCTA ACCGCTTTTT TGCACAACAT 432 0 

GGGGGATCAT GTAACTCGCC TTGATCGTTG GGAACC GGAG CTGAATGAAG CCATACCAAA 438 0 

CGACGAGCGT GACACCACGA TGCCTGTAGC AATGGCAACA ACGTTGCGCA AACTATTAAC 4440 

TGGCGAACTA CTTACTCTAG CTTCCCGGCA ACAATTAATA G AC TGGATGG AGGCGGATAA 45 00 

AGTTGCAGGA CCACTTCTGC GCTCGGCCCT TCCGGCTGGC TGGTTTATTG CTGATAAATC 456 0 

TGGAGCCGGT GAGCGTGGGT CTCGCGGTAT CATTGCAGCA CTGGGGCCAG ATGGTAAGCC 462 0 

CTCCCGTATC GTAGTTATCT AC AC GACGGG GAGTCAGGCA ACTATGGATG AACGAAATAG 4680 

ACAGATCGCT GAGATAGGTG CCTCACTGAT TAAGCATTGG TAACTGTCAG ACCAAGTTTA 4740 

CTCATATATA CTTTAGATTG ATTTAAAACT TCATTTTTAA TTTAAAAGGA TCTAGGTGAA 4800 

GATCCTTTTT GATAATCTCA TGACCAAAAT CCCTTAACGT GAGTTTTCGT TCCACTGAGC 4860 

GTCAGACCCC GTAGAAAAGA TCAAAGGATC TTCTTGAGAT CCTTTTTTTC TGCGCGTAAT 492 0 

CTGCTGCTTG CAAACAAAAA AACCACCGCT ACCAGCGGTG GTTTGTTTGC CGGATCAAGA 4980 

GCTACCAACT CTTTTTCCGA AGGTAACTGG CTTCAGCAGA GCGCAGATAC CAAATACTGT 5040 

CCTTCTAGTG TAGCCGTAGT TAGGCCACCA CTTCAAGAAC TCTGTAGCAC CGCCTACATA 5100 

CCTCGCTCTG CTAATCCTGT TACCAGTGGC TGCTGCCAGT GGCGATAAGT CGTGTCTTAC 5160 

CGGGTTGGAC TCAAGACGAT AGTTACCGGA TAAGGC GC AG CGGTCGGGCT GAACGGGGGG 5220 

TTCGTGCACA CAGCCCAGCT TGGAGCGAAC GACCTACACC GAACTGAGAT ACCTACAGCG 5280 
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TGAGCATTGA GAAAGCGCCA CGCTTCCCGA AGGGAGAAAG GCGGACAGGT ATCCGGTAAG 5340 

CGGCAGGGTC GGAACAGGAG AGCGCACGAG GGAGCTTCC A . GGGGGAAACG CCTGGTATCT 5400 

TTATAGTCCT GTCGGGTTTC GCCACCTCTG ACTTGAGCGT CGATTTTTGT GATGCTCGTC 5460 

AGGGGGGCGG AGCCTATGGA AAAACGCCAG CAACGCGGCC TTTTTACGGT TCCTGGCCTT 5520 

TTGCTGGCCT TTTGCTCACA TGTTCTTTCC TGCGTTATCC CCTGATTCTG TGGATAACCG 5580 

TATTACCGCC TTTGAGTGAG CTGATACCGC TCGCCGCAGC CGAACGACCG AGCGCAGCGA 5640 

GTCAGTGAGC GAGGAAGCGG AAGAGCGCCC AAT AC GC AAA CCGCCTCTCC CCGCGCGTTG 57 00 

GCCGATTCAT TAATGCAGCT GGC AC GAC AG GTTTCCCGAC TGGAAAGCGG GCAGTGAGCG 57 60 

CAACGCAATT AATGTGAGTT AGCTCACTCA TTAGGCACCC CAGGCTTTAC ACTTTATGCT 582 0 

TCCGGCTCGT ATGTTGTGTG GAATTGTGAG CGGATAACAA TTTCACACAG GAAACAGCTA 5880 

TGACCATGAT TACGCCA 5897 
(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6898 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 

AGCTTGCATG CCTGCAGTGC AGCGTGACCC GGTCGTGCCC CTCTCTAGAG ATAATGAGCA 60 

TTGCATGTCT AAGTTATAAA AAATTAC C AC ATATTTTTTT TGTCACACTT GTTTGAAGTG 120 

CAGTTTATCT ATCTTTATAC ATATATTTAA ACTTTAATCT ACGAATAATA TAATCTATAG 180 

TACTACAATA ATATCAGTGT TTTAGAGAAT CATATAAATG AACAGTTAGA CATGGTCTAA 240 

AGGACAATTG AGTATTTTGA CAACAGGACT CTACAGTTTT ATCTTTTTAG TGTGCATGTG 300 

TTCTCCTTTT TTTTTGCAAA TAGCTTCACC TATATAATAC TTCATCCATT TTATTAGTAC 3 60 

ATCCATTTAG GGTTTAGGGT TAATGGTTTT TATAGACTAA TTTTTTTAGT ACATCTATTT 42 0 

TATTCTATTT TAGCCTCTAA ATTAAGAAAA CTAAAACTCT ATTTTAGTTT TTTTATTTAA 480 

TAATTTAGAT ATAAAATAGA ATAAAATAAA GTGACTAAAA ATTAAACAAA TACCCTTTAA 540 

GAAATTAAAA AAACTAAGGA AACATTTTTC TTGTTTCGAG TAGATAATGC CAGCCTGTTA 600 

AACGCCGTCG ACGAGTCTAA CGGACACCAA CCAGCGAACC AGCAGCGTCG CGTCGGGCCA 660 

AGCGAAGCAG ACGGCACGGC ATCTCTGTCG CTGCCTCTGG ACCCCTCTCG AGAGTTCCGC 720 
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TCCACCGTTG GACTTGCTCC GCTGTCGGCA TCCAGAAATT GCGTGGCGGA GCGGCAGACG 780 

TGAGCCGGCA CGGCAGGCGG CCTCCTCCTC CTCTCACGGC ACGGCAGCTA CGGGGGATTC 840 

CTTTCCCACC GCTCCTTCGC TTTCCCTTCC TCGCCCGCCG TAATAAATAG ACACCCCCTC 900 

CACACCCTCT TTCCCCAACC TCGTGTTGTT CGGAGCGCAC ACACACACAA CCAGATCTCC 960 

CCCAAATCCA CCCGTCGGCA CCTCCGCTTC AAGGTACGCC GCTCGTCCTC CCCCCCCCCC 1020 

CCTCTCTACC TTCTCTAGAT CGGCGTTCCG GTCCATGCAT GGTTAGGGCC CGGTAGTTCT 1080 

ACTTCTGTTC ATGTTTGTGT TAGATCCGTG TTTGTGTTAG ATCCGTGCTG CTAGCGTTCG 1140 

TACACGGATG CGACCTGTAC GTCAGACACG TTC TGATTGC TAACTTGCCA GTGTTTCTCT 1200 

TTGGGGAATC CTGGGATGGC TCTAGCCGTT CCGCAGACGG GATC GATTTC ATGATTTTTT 12 60 

TTGTTTCGTT GCATAGGGTT TGGTTTGCCC TTTTCCTTTA TTTCAATATA TGCCGTGCAC 132 0 

TTGTTTGTCG GGTCATCTTT TCATGCTTTT TTTTGTCTTG GTTGTGATGA TGTGGTCTGG 1380 

TTGGGCGGTC GTTCTAGATC GGAGTAGAAT TCTGTTTCAA ACTACCTGGT GGATTTATTA 1440 

ATTTTGGATC TGTATGTGTG TGCCATACAT ATTCATAGTT ACGAATTGAA GATGATGGAT 1500 

GGAAATATCG ATCTAGGATA GGTATACATG TTGATGCGGG TTTTACTGAT GCATATACAG 1560 

AGATGCTTTT TGTTCGCTTG GTTGTGATGA TGTGGTGTGG TTGGGCGGTC GTTCATTCGT 1620 

TCTAGATCGG AGTAGAATAC TGTTTCAAAC TACCTGGTGT ATTTATTAAT TTTGGAACTG 1680 

TATGTGTGTG TCATACATCT TCATAGTTAC GAGTTTAAGA TGGATGGAAA TATCGATCTA 1740 

GGATAGGTAT ACATGTTGAT GTGGGTTTTA CTGATGCATA TACATGATGG CATATGCAGC 1800 

ATCTATTCAT ATGCTCTAAC CTTGAGTACC TATCTATTAT AATAAACAAG TATGTTTTAT 1860 

AATTATTTTG ATCTTGATAT ACTTGGATGA TGGCATATGC AGCAGCTATA TGTGGATTTT 1920 

TTTAGCCCTG CCTTCATACG CTATTTATTT GCTTGGTACT GTTTCTTTTG TCGATGCTCA 1980 

CCCTGTTGTT TGGTGTTACT TCTGCAGGGT ACCCCCGGGG TCGACCATGG TCCGTCCTGT 2040 

AGAAACCCCA ACCCGTGAAA TCAAAAAACT CGACGGCCTG TGGGCATTCA GTCTGGATCG 2100 

CGAAAACTGT GGAATTGATC AGCGTTGGTG GGAAAGCGCG TTACAAGAAA GCCGGGCAAT 2160 

TGCTGTGCCA GGCAGTTTTA ACGATCAGTT CGCCGATGCA GATATTCGTA ATTATGCGGG 222 0 

CAACGTCTGG TATCAGCGCG AAGTCTTTAT ACCGAAAGGT TGGGCAGGCC AGCGTATCGT 2280 

GCTGCGTTTC GATGCGGTCA CTCATTACGG CAAAGTGTGG GTCAATAATC AGGAAGTGAT 2340 

GGAGCATCAG GGCGGCTATA CGCCATTTGA AGCCGATGTC ACGCCGTATG TTATTGCCGG 2400 
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GAAAAGTGTA CGTATCACCG TTTGTGTGAA CAACGAACTG AACTGGCAGA CTATCCCGCC 2460 

GGGAATGGTG ATTACC GACG AAAACGGCAA GAAAAAGCAG TCTTACTTCC ATGATTTCTT 2520 

TAACTATGCC GGAATCCATC GCAGCGTAAT GCTCTACACC ACGCCGAACA CCTGGGTGGA 2 580 

CGATATCACC GTGGTGACGC ATGTCGCGCA AGACTGTAAC CACGCGTCTG TTGACTGGCA 2640 

GGTGGTGGCC AATGGTGATG TCAGCGTTGA ACTGCGTGAT GCGGATCAAC AGGTGGTTGC 2700 

AACTGGACAA GGCACTAGCG GGACTTTGCA AGTGGTGAAT CCGCACCTCT GGCAACCGGG 2760 

TGAAGGTTAT CTCTATGAAC TGTGCGTCAC AGCCAAAAGC CAGACAGAGT GTGATATCTA 2 820 

CCCGCTTCGC GTCGGCATCC GGTCAGTGGC AGTGAAGGGC GAACAGTTCC TGATTAACCA 2880 

CAAACCGTTC TACTTTACTG GCTTTGGTCG TCATGAAGAT GCGGACTTAC GTGGCAAAGG 2940 

ATTCGATAAC GTGCTGATGG TGCACGACCA CGCATTAATG GACTGGATTG GGGCCAACTC 3000 

CTACCGTACC TCGC ATTACC CTTACGCTGA AGAGATGCTC GACTGGGCAG ATGAACATGG 3060 

CATCGTGGTG ATTGATGAAA CTGCTGCTGT CGGCTTTAAC CTCTCTTTAG GCATTGGTTT 3120 

CGAAGCGGGC AACAAGCCGA AAGAACTGTA CAGCGAAGAG GCAGTCAACG GGGAAAC TC A 3180 

GCAAGCGCAC TTACAGGCGA TTAAAGAGCT GATAGCGCGT GACAAAAACC ACCCAAGCGT 3240 

GGTGATGTGG AGTATTGCCA ACGAACCGGA TACCCGTCCG CAAGTGCACG GGAATATTTC 3300 

GCCACTGGCG GAAGCAACGC GTAAACTCGA CCCGACGCGT CCGATCACCT GCGTCAATGT 33 60 

AATGTTCTGC GACGCTCACA CCGATACCAT CAGCGATCTC TTTGATGTGC TGTGCCTGAA 3420 

CCGTTATTAC GGATGGTATG TCCAAAGCGG CGATTTGGAA ACGGCAGAGA AGGTACTGGA 3480 

AAAAGAACTT CTGGCCTGGC AGGAGAAACT GCATCAGCCG ATTATCATCA CCGAATACGG 3540 

CGTGGATACG TTAGCCGGGC TGCACTCAAT GTACACCGAC ATGTGGAGTG AAGAGTATCA 3600 

GTGTGCATGG CTGGATATGT ATCACCGCGT CTTTGATCGC GTCAGCGCCG TCGTCGGTGA 3660 

ACAGGTATGG AATTTCGCCG ATTTTGCGAC CTCGCAAGGC ATATTGCGCG TTGGCGGTAA 3720 

CAAGAAAGGG ATCTTCACTC GCGACCGCAA ACCGAAGTCG GCGGCTTTTC TGCTGCAAAA 3780 

ACGCTGGACT GGCATGAACT TCGGTGAAAA AC CGC AGC AG GGAGGCAAAC AATGAATCAA 3840 

CAACTCTCCT GGCGCACCAT CGTCGGCTAC AGCCTCGGTG GGGAATTGGA GAGCTCTGAG 3900 

GGCACTGAAG TCGCTTGATG TGCTGAATTG TTTGTGATGT TGGTGGCGTA TTTTGTTTAA 3960 

ATAAGTAAGC ATGGCTGTGA TTTTATCATA TGATCGATCT TTGGGGTTTT ATTTAACACA 4020 

TTGTAAAATG TGTATCTATT AATAACTCAA TGTATAAGAT GTGTTCATTC TTCGGTTGCC 4080 

ATAGATCTGC TTATTTGACC TGTGATGTTT TGACTCCAAA AACCAAAATC ACAACTCAAT 4140 
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AAACTCATGG AATATGTCCA CCTGTTTCTT GAAGAGTTCA TCTACCATTC CAGTTGGCAT 42 00 

TTATCAGTGT TGCAGCGGCG CTGTGCTTTG TAACATAACA ATTGTTCACG GCATATATCC 42 60 

AAGAATTCAC TGGCCGTCGT TTTACAACGT CGTGACTGGG AAAACCCTGG CGTTACCCAA 4320 

CTTAATCGCC TTGCAGCACA TCCCCCTTTC GCCAGCTGGC GTAATAGCGA AGAGGCCCGC 43 80 

ACCGATCGCC CTTCCCAACA GTTGCGCAGC CTGAATGGCG AATGGCGCCT GATGCGGTAT 4440 

TTTCTCCTTA CGCATCTGTG CGGTATTTCA CACCGCATAT GGTGCACTCT CAGTACAATC 4500 

TGCTCTGATG CCGCATAGTT AAGCCAGCCC CGACACCCGC CAACACCCGC TGACGCGCCC 45 60 

TGACGGGCTT GTCTGCTCCC GGCATCCGCT TACAGACAAG CTGTGACCGT CTCCGGGAGC 462 0 

TGCATGTGTC AGAGGTTTTC ACCGTCATCA CCGAAACGCG C GAG AC G AAA GGGCCTCGTG 4680 

ATACGCCTAT TTTTATAGGT TAATGTCATG ATAATAATGG TTTCTTAGAC GTCAGGTGGC 4740 

ACTTTTCGGG GAAATGTGCG CGGAACCCCT ATTTGTTTAT TTTTCTAAAT ACATTCAAAT 4800 

ATGTATCCGC TCATGAGACA ATAACCCTGA TAAATGCTTC AATAATATTG AAAAAGGAAG 4860 

AGTATGAGTA TTCAACATTT CCGTGTCGCC CTTATTCCCT TTTTTGCGGC ATTTTGCCTT 4920 

CCTGTTTTTG CTCACCCAGA AACGCTGGTG AAAGTAAAAG ATGCTGAAGA TCAGTTGGGT 4980 

GCACGAGTGG GTTACATCGA ACTGGATCTC AACAGCGGTA AGATCCTTGA GAGTTTTCGC 5040 

CCC GAAGAAC GTTTTCCAAT GATGAGCACT TTTAAAGTTC TGCTATGTGG CGCGGTATTA 5100 

TCCCGTATTG ACGCCGGGCA AGAGCAACTC GGTCGCCGCA TACACTATTC TCAGAATGAC 5160 

TTGGTTGAGT ACTCACCAGT CACAGAAAAG CATCTTACGG ATGGCATGAC AGTAAGAGAA 5220 

TTATGCAGTG CTGCCATAAC CATGAGTGAT AACACTGCGG CCAACTTACT TCTGACAACG 5280 

ATCGGAGGAC CGAAGGAGCT AACCGCTTTT TTGCACAACA TGGGGGATCA TGTAACTCGC 5340 
CTTGATCGTT GGGAACCGGA GCTGAATGAA GCCATACCAA ACGACGAGCG TGACACCACG ' 5400 

ATGCC TGT AG CAATGGCAAC AACGTTGCGC AAACTATTAA CTGGCGAACT ACTTACTCTA 5460 

GCTTCCCGGC AACAATTAAT AGACTGGATG GAGGCGGATA AAGTTGCAGG ACCACTTCTG 5520 

CGCTCGGCCC TTCCGGCTGG CTGGTTTATT GCTGATAAAT CTGGAGCCGG TGAGC GTGGG 5580 

TCTCGCGGTA TCATTGCAGC ACTGGGGCCA GATGGTAAGC CCTCCCGTAT CGTAGTTATC 5640 

TACACGACGG GGAGTCAGGC AACTATGGAT GAACGAAATA GACAGATCGC TGAGATAGGT 5700 

GCCTCACTGA TTAAGCATTG GTAACTGTCA GACCAAGTTT ACTCATATAT ACTTTAGATT 5760 

GATTTAAAAC TTCATTTTTA ATTTAAAAGG ATCTAGGTGA AGATCCTTTT TGATAATCTC 5820 
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ATGACCAAAA TCCCTTAACG TGAGTTTTCG TTCCACTGAG CGTCAGACCC CGTAGAAAAG 5880 

ATCAAAGGAT CTTCTTGAGA TCCTTTTTTT CTGCGCGTAA TCTGCTGCTT GCAAACAAAA 5940 

AAACCACCGC TACCAGCGGT GGTTTGTTTG CCGGATCAAG . AGCTACCAAC TCTTTTTCCG 6000 

AAGGTAACTG GCTTCAGCAG AGCGCAGATA CCAAATACTG TCCTTCTAGT GTAGCCGTAG 6060 

TTAGGCCACC ACTTCAAGAA CTCTGTAGCA CCGCCTACAT ACCTCGCTCT GCTAATCCTG 6120 

TTACCAGTGG CTGCTGCCAG TGGCGATAAG TCGTGTCTTA CCGGGTTGGA CTCAAGACGA 6180 

TAGTTACCGG ATAAGGCGCA GCGGTCGGGC TGAAC GGGGG GTTCGTGCAC ACAGCCCAGC 6240 

TTGGAGCGAA CGACCTACAC CGAACTGAGA TACCTACAGC GTGAGCATTG AGAAAGC GCC 6300 

ACGCTTCCCG AAGGGAGAAA GGCGGACAGG TATCCGGTAA GCGGCAGGGT CGGAACAGGA 6360 

GAGCGCACGA GGGAGCTTCC AGGGGGAAAC GCCTGGTATC TTTATAGTCC TGTCGGGTTT 642 0 

CGCCACCTCT GACTTGAGCG TCGATTTTTG TGATGCTCGT CAGGGGGGCG GAGCCTATGG 6480 

AAAAACGCCA GCAACGCGGC CTTTTTACGG TTCCTGGCCT TTTGCTGGCC TTTTGCTCAC 6540 

ATGTTCTTTC CTGCGTTATC CCCTGATTCT GTGGATAACC GTATTAC CGC CTTTGAGTGA 6600 

GCTGATACCG CTCGCCGCAG CCGAACGACC GAGCGCAGCG AGTCAGTGAG CGAGGAAGCG 6660 

GAAGAGCGCC CAATACGCAA ACCGCCTCTC CCCGCGCGTT GGCCGATTCA TTAATGCAGC 6720 

TGGCACGACA GGTTTCCCGA CTGGAAAGCG GGCAGTGAGC GCAACGCAAT TAATGTGAGT 6780 

TAGCTCACTC ATTAGGCACC CCAGGCTTTA CACTTTATGC TTCCGGCTCG TATGTTGTGT 6840 

GGAATTGTGA GCGGATAACA ATTTCACACA GGAAACAGCT ATGACCATGA TTACGCCA 6898 
(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

CAGATCTGCA GATCTGCATG GGCGATG 27 
(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
GGGGACTCTA GAGGATCCCC GGGTGGTCAG TCCCTT 
(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
GAATTTCCCC 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
GATCCGGATC CG 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: 
TCGACGGATC CG 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 

GGGGACTCTA GAGGATCCCG AATTTCCCC 29 
(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

GATCCAGCTG AAGGCTCGAC AAGGCAGATC CACGGAGGAG CTGATATTTG GTGGACA 57 
(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

AGCTTGTCCA CCAAATATCA GCTCCTCCGT GGATCTGCCT TGTCCAGCCT TCAGCTG 57 
(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

AGCTGTGGAT AGGAGCAACC CTATCCCTAA TATACCAGCA CCACCAAGTC AGGGCAATCC 60 
CGGG 64 
(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

TCGACCCGGG ATTGCCCTGA CTTGGTGGTG CTGGTATATT AGGGATAGGG TTGCTCCTAT 60 
CCAC 64 
(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

CCGGGCCATT TGTTCCAGGC ACGGGATAAG CATTCAGCCA TGGGATATCA AGCTTGGATC 60 
CC 62 
(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

TCGAGGGATC CAAGCTTGAT ATCCCATGGC TGAATGCTTA TCCCGTGCCT GGAACAAATG 60 
GC 62 
(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

GATATCAAGC TTGGATCCC 19 
(2) INFORMATION FOR SEQ ID NO: 41: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41 

CGGTACCTCG AGTTAAC 

(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42 
CATGGTTAAC TCGAGGTACC GAGCT 
(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43 
ATCTGCATGG GTG 

(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44 
GGGGACTCTA GAGGATCCAG 
(2) INFORMATION FOR SEQ ID NO: 45: 
(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 
GTTAACTCGA GGTACCGAGC TCGAATTTCC CC 
(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:46: 
GAGTTCAGGC TTTTTCATAG CT 
(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 
AGATCTCGTG AGATAATGAA AAAG 
(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

ACTCGCCGAT AGTGGAAACC GACGCCCCAG CACTCGTCCG AGGGCAAAGG AATAGTAAGA 
GCTCGG 

(2) INFORMATION FOR SEQ ID NO: 49: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

GATCCCGAGC TCTTACTATT CCTTTGCCCT CGGACGAGTG CTGGGGCGTC GGTTTCCACT 
ATCGGC GAGT 

(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 88 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

CTGCAGGCCG GCCTTAATTA AGCGGCCGCG TTTAAACGCC CGGGCATTTA AATGGCGCGC 
CGCGATCGCT TGCAGATCTG CATGGGTG 
(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 
GAC GGATCTG 

(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 
TGAGATCTGA GCTCGAATTT CCCC 
(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 
GGTACCCCCG GGGTCGACCA TGG 
(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 
GGGAATTGGA GCTCGAATTT CCCC 
(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 
GGGAAATTAA GCTT 

(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

AGCGGCCGCA TTCCCGGGAA GCTTGCATGC CTGCAGAGAT CCGGTACCCG GGGATCCTCT 
AGAGTCGAC 

(2) INFORMATION FOR SEQ ID NO:57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

GGTACCCCCG GGGTCGACCA TGGTTAACTC GAGGTACCGA GCTCGAATTT CCCC 
(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 
GGGAATTGGT TTAAACGCGG CCGCTT 
(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

CCATGCATGG 10 
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